Best Available Copy 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
11 July 2002 (11.07.2002) 




PCT 



(10) International Publication Number 

WO 02/054069 Al 



(51) International Patent Classification 7 : 



G01N 33/53 



(2 1 ) International Application Number: PCT/US0 1/49808 



(22) International Filing Date: 

26 December 2001 (26.12.2001) 



(25) Filing language: 



(26) Publication Language: 



English 



English 



(30) Prioritx Data: 
(»( 1/25* l. .*7»i 
lO/l)2(».IXX 



29 December 2000 (29. 1 2.2000) US 
21 December 2001 (21.12.2001) US 



(71) Applicant </<»/- u// designated States except US): THE 
REG E NTS OF TH E UNIVERSITY OF CALIFORNIA 

1 1 iS/l IS|: 111 Franklin Street, 5th, Oakland, CA 94607 
(US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): ZUKJER, Charles, 

S. |US/US|: 477S Thurston Place, San Diego, CA 92130 
(US). ZHANG, Yifeng |CN/US]; 4083-D Miramar Street, 
U Jolly. CA 92037 < US). 



(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NO, NZ, OM, PH, PL, PT, RO, RU, SD, SE, SG, 
SI, SK, SL, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, 
VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE, TR), OAPI patent 
(BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, ML, MR, 
NE, SN, TD, TG). 

Published: 

— with international search report 

— before the expiration of the time limit for amending the 
claims and to be republished in the event of receipt of 
amendments 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations*' appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



(74) Agents: PARENT, Annette, S. et al.; Townscnd and 
Townsend and Crew. LLP, Two Embarcadcro Center, 8th 
Hoor, San l-rancisco. CA 941 1 1-3834 (US). 



ON 



(54) Title: ASSAYS FOR TASTE RECEPTOR CELL SPECIFIC ION CHANNEL 

O < 57 > Abstract: The invention includes nucleic acid and amino acid sequences of a taste <:ell-specific ion channel subunit that is 
^ specifically expressed in taste cells, antibodies to such subunits, methods of detecting such nucleic acids and proteins, and methods 
^* of screening for modulators of taste cell specific ion channel subunits. 



Best Available Copy ■ ^ 



WO 02/054069 PCT/USO 1/49808 



ASSAYS FOR TASTE RECEPTOR CELL SPECIFIC ION CHANNEL 

CROSS-REFERENCES TO RELATED APPLICATIONS 
The present application claims priority to USSN 60/259,379, filed 
December 29, 2000, herein incorporated by reference in its entirety. 



STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER 
FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT 
10 This invention was made with Government support under Grant No. 5R01 

DC03160, awarded by the National Institutes of Health. The Government has certain 

rights in this invention. 

FIELD OF THE INVENTION 
! 5 The invention identifies nucleic acid and amino acid sequences of an ion 

channel protein that is specifically expressed in taste cells, antibodies to the taste cell 
specific ion channel protein, methods of detecting such nucleic acids and subunits, and 
methods of screening for modulators of said taste cell specific ion channel. 

20 BACKGROUND OF THE INVENTION 

Taste transduction is one of the most sophisticated forms of 
chemotransduction exhibited by animals, from simple metazoans to the most complex of 
vertebrates. See, e.g., Avenet & Lindemann, J. Membrane Biol. 112:1-8 (1989); 
Margolskee, BioEssays 15:645-650 (1993); Gilbertson, T. Hie physiology of vertebrate 

25 taste reception 3, 532-539 (1993); Kinnamon, S.C. and Margolskee, R.F. (1996), Curr. 

Opm. Neurobiol., 4. 506-513; Roper, S.D. Ann. Rev. Neurosci. 12, 329-353 (1989); Hoon 
et al. (1999), Cell 96, 541-51; Adler et al. (2000), Cell 100:693-702; Chandrashekar et al. 
(2000), Cell 100:703-71 1. It enables animals to reliably detect and appropriately respond 
to chemical compounds present in their environment: for example, avoid toxic 

30 substances, foods, environments and enemies; or identify edible foods or sources of food, 
livable environments, and familiar or compatible individuals. 
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Higher organisms generally are able to discriminate between four basic 
types of taste modalities: salty, sour, sweet, and bitter. Mammals reportedly have five 
basic taste modalities: sweet, bitter, sour, salty and unami (the taste of monosodium 
glutamate) (see, e.g., Kawamura & Kare, Introduction to Unami: A Basic Taste (1987); 

5 Kinnamon & Cummings, Ann. Rev. Physiol. 54:715-731(1992); Lindemann, Physiol. Rev. 
76:718-766 (1996); Stewart et al., Am. J. Physiol. 272:1-26 (1997)). Each of these 
modalities is thought to be mediated by distinct signaling pathways leading to receptor 
cell depolarization, generation of a receptor or action potential, and the release of 
neurotransmitter and synaptic activity (see, e.g., Roper„4wz. Rev. Neurosci. 12:329-353 

10 (1989)). 

Extensive psychophysical studies in humans have reported that different 
regions of the tongue display different gustatory preferences (see, e.g., Hoffmann, 
Menchen. Arch. Path. Anat. Physiol. 62:516-530 (1875); Bradley et al, Anatomical 
Record 212: 246-249 (1985); Miller & Reedy, Physiol. Behav. 47:1213-1219 (1990)). 
15 Also, numerous physiological studies in animals have shown that taste receptor cells may 
selectively respond to different tastants (see, e.g., Akabas et al, Science 242:1047-1050 
(1988); Gilbertson et al., J. Gen. Physiol. 100:803-24 (1992); Bernhardt et al., J. Physiol. 
490:325-336 (1996); Curmriings et al.,J..Neurophysiol. 75:1256-1263 (1996)). 

In mammals, taste receptor cells are assembled into taste buds that are 
20 distributed into different papillae in the tongue epitheUum. Each taste bud, depending on 
the species, contain 50-150 cells, including precursor cells, support cells, and taste 
receptor cells (see, e.g., Lindemann, Physiol. Rev. 76:718-766 (1996)). Receptor cells are 
innervated at their base by afferent nerve endings that transmit information to the taste 
centers of the cortex through synapses in the brain stem and thalamus. 
25 Circumvallate papillae, found at the very back of the tongue, contain 

hundreds (mice) to thousands (human) of taste buds and are particularly sensitive to bitter 
substances. Foliate papillae, localized to the posterior lateral edge of the tongue, contain 
dozens to hundreds of taste buds and are particularly sensitive to sour and bitter 
substances. Fungiform papillae containing a single or a few taste buds are at the front of 
30 the tongue and are thought to mediate much of the sweet taste modality. 

Elucidating the mechanisms of taste cell signaling and information 
processing are critical for understanding the function, regulation, and "perception" of the 
sense of taste. Although much is known about the psychophysics and physiology of taste 
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cell function, very little is known about the molecules and pathways that mediate these 
sensory signaling responses (reviewed by Gilbertson, Current Opn. in Neurobiol. 3:532- 
539 (1993)). Electrophysiological studies suggest that sour and salty tastants modulate 
taste cell function by direct entry of H+ and Na + ions through specialized membrane 
channels on the apical surface of the cell. In the case of sour compounds, taste cell 
depolarization is hypothesized to result from H+ blockage of K + channels {see, e.g., 
Kinnamon et al, PNAS USA 85: 7023-7027 (1988)) or activation of pH-sensitive 
channels (see, e.g., Gilbertson et al, J. Gen. Physiol. 100:803-24 (1992)); salt + 
transduction may be partly mediated by the entry of Na + via amiloride-sensmve Na 
channels (see, e.g., Heck et al, Science 223:403-405 (1984); Brand et al., Brain Res. 207- 
214 (1985); Avenet etal, Nature 331:351-354 (1988)). Most of molecular components 
of the sour or salty pathways have not been identified. 

Sweet, bitter, and unami transduction are believed to be mediated by G- 
protein-coupled receptor (GPCR) signaling pathways (see, e.g. , Striem et al, Biochem. J. 
260:121-126 (1989); Chaudhari et al, J. Neuros. 16:3817-3826 (1996); Wong et al, 
Nature 3S1 :796-800 (1996)). Confusingly, there are almost as many models of signaling 
pathways for sweet and bitter transduction as there are effector enzymes for GPCR 
cascades (e.g. , G protein subunits, cGMP phosphodiesterase, phospholipase C, adenylate 
cyclase; see, e.g., Kinnamon & Margolskee, Curr. Opin. Neurobiol 6:506-513 (1996)). 
Identification of molecules involved in taste signaling is important given the numerous 
pharmacological and food industry applications for bitter antagonists, sweet agonists, and 

modulators of salty and sour taste. 

The identification and isolation of taste receptors (including taste ion 
channels), and taste signaling molecules, such as G-protein subunits, ion channels and 
enzymes involved in signal transduction, would allow for the pharmacological and 
genetic modulation of taste transduction pathways. For example, availability of receptor, 
ion channels, and other molecules involved in taste transduction would permit the 
screening for high affinity agonists, antagonists, inverse agonists, and modulators of taste 
cell activity. Such taste modulating compounds could then be used in the pharmaceutical 
0 and food industries to customize taste. In addition, such taste cell specific molecules can 
serve as invaluable tools in the generation of taste topographic maps that elucidate the 
relationship between the taste cells of the tongue and taste sensory neurons leading to 
taste centers in the brain. 
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SUMMARY OF THE INVENTION 
The present invention demonstrates, for the first time, taste cell-specific 
expression of nucleic acids encoding an ion channel subunit. The taste cell-specific ion 
5 channel subunits that are specifically expressed in taste cells can thus be used to screen 
for modulators of taste cell function and to control taste perception. The compounds 
identified by these assays would then be used by the food and pharmaceutical industries 
to customize taste, e.g., as additives to food or medicine so that the food or medicine 
tastes different to the subject who ingests it. For example, bitter medicines can be made 
10 to taste less bitter, and sweet substance can be enhanced. 

Using isolated, hand-dissected taste buds and papillae from the rat 
circumvallate papillae, subtracted cDNA libraries against non-taste lingual tissue were 
generated and screened for sequences preferentially expressed in taste receptor cells. 
Clones representing differentially expressed genes were isolated, mapped by in situ 
15 hybridization to single taste receptor cells, and used as probe to isolate and characterize 
full length cDNA sequences. This procedure led to the isolation of a novel taste-specific 
ion channel. 

In one aspect, the present invention provides a method for identifying a 
compound that modulates transduction of taste signals in taste cells, the method 
20 comprising the steps of: (i) contacting the compound with a eukaryotic host cell or cell 
membrane in which has been expressed a taste cell-specific ion channel subunit having 
(a) greater than about 70% amino acid sequence identity to a polypeptide having a 
sequence of SEQ ID NO:2, SEQ ID NO:5 or SEQ ID NO:8; and (b) specifically binding 
to polyclonal antibodies generated against SEQ ED NO:2, SEQ ID NO:5 or SEQ ID 
25 NO: 8; and (ii) determining a functional effect of the compound upon the cell or cell 
membrane expressing the taste cell-specific ion channel subunit. 

In one embodiment, the functional effect is determined by measuring 
changes in intracellular cAMP, cGMP, IP 3 , DAG, or Ca 2+ . 

In another embodiment, the functional effect is determined by measuring 
30 changes in the level of phosphorylation of taste cell specific proteins. 

In another embodiment, the functional effect is determined by measuring 
changes in transcription levels of taste cell specific genes. 
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In another embodiment, the taste cell specific ion channel subunits are 

recombinant. 

In other embodiments, the taste cell-specific ion channel subunit have an 
amino acid sequence of SEQ ID NO:2, SEQ ID NO:5 or SEQ ID NO:8. 

In another aspect, the present invention provides a method identifying a 
compound that modulates taste signaling in taste cells, the method comprising the steps 
of: ® expressing a taste cell-specific ion channel subunit in an HEK 293 host cell, 
wherein the taste cell-specific ion channel subunit: (a) has greater than about 70% amino 

acid sequence identity to a polypeptide having a sequence of SEQ ID NO:2, SEQ ID 
NO:5 or SEQ ID NO:8; and (b) specifically binds to polyclonal antibodies generated 

against SEQ ID NO:2, SEQ ID NO:5 or SEQ ID NO:8; (ii) contacting the host cell with 

the compound that modulates taste signaling in taste cells; and (iii) detenmning changes 

in intracellular calcium levels in the host cell. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is a picture of an in situ hybridization of a nucleic acid of the 
present invention to a tissue section, demonstrating that the nucleic acid is specific for 
taste cells. 

DETAILED DESCRIPTION OF THE INVENTION 

I. Introduction 

The present invention relates to nucleic acids that encode taste cell-specific 
ion channel subunits that are specifically expressed in taste cells. These nucleic acids and 
the polypeptides that they encode are referred to as "TC-ICS" for "taste cell specific ion 
channel subunit." These taste cell specific nucleic acids and polypeptides are components 
of the taste transduction pathway. 

The invention provides methods of screening for modulators, e.g., 
activators, inhibitors, stimulators, enhancers, agonists, and antagonists of TC-ICS. Such 
modulators of taste transduction are useful for pharmacological and genetic modulation of 
taste signaling pathways. These methods of screening are used to identify high affinity 
agonists and antagonists of taste cell activity. These modulatory compounds can then be 
used in the food and pharmaceutical industries to customize and/or regulate taste. The 
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modulatory compounds typically would be added to a food or medicine, thereby altering 
its taste to the subject who ingests it. 

Thus, the invention provides assays for taste modulation, where TC-ICS 
acts as a direct or indirect reporter molecule for the effect of modulators on taste 
5 transduction. TC-ICS are used in assays, e.g., to measure changes in ion concentration, 
membrane potential, current flow, ion flux, transcription, signal transduction, receptor- 
ligand interactions, second messenger concentrations, in vitro, in vivo, and ex vivo. In 
one embodiment, a TC-ICS is recombinantly expressed in cells, and modulation of taste 
transduction is assayed by measuring changes in Ca 2+ levels (see Example II). Methods 
10 of assaying for modulators of taste transduction include oocyte or tissue culture cell 
expression of TC-ICS; transcriptional activation of TC-ICS; phosphorylation and 
dephosphorylation of TC-ICS; ligand binding assays; voltage, membrane potential and 
conductance changes; ion flux assays; changes in intracellular second messengers such as 
cAMP and inositol triphosphate; changes in intracellular calcium levels; and 
15 neurotransmitter release. 

These nucleic acids and proteins also provide valuable probes for the 
identification of taste cells, as the nucleic acids are specifically expressed in taste cells. 
For example, probes for TC-ICS are used to identify subsets of taste cells such as foliate 
cells and circumvallate cells, or specific taste receptor cells, e.g., sweet, sour, salty, and 
20 bitter. They also serve as tools for the generation of taste topographic maps that elucidate 
the relationship between the taste cells of the tongue and taste sensory neurons leading to 
taste centers in the brain. Furthermore, the nucleic acids and the proteins they encode are 
used as probes to dissect taste-induced behaviors. 

Finally, the invention provides for methods of detecting TC-ICS nucleic 
25 acid and protein expression, allowing investigation of taste transduction regulation and 
specific identification of taste receptor cells. TC-ICSs also provide useful nucleic acid 
probes for paternity and forensic investigations. TC-ICSs are useful nucleic acid probes 
for identifying subpopulations of taste receptor cells such as foliate, fungiform, and 
circumvallate taste receptor cells. TC-ICS polypeptides can also be used to generate 
30 monoclonal and polyclonal antibodies useful for identifying taste receptor cells, e.g. , in 
imrnuno histochemical assays. Taste receptor cells can also be identified using 
techniques such as reverse transcription and amplification of mRNA, isolation of total 
RNA or poly A + RNA, northern blotting, dot blotting, in situ hybridization, RNase 
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protection, SI digestion, probing high density oligonucleotide arrays, western blots, and 
the like. 

Functionally, TC-ICS represents a subunit of an ion channel involved in 
taste transduction. Structurally, the nucleotide sequence of TC-ICS (including SEQ ID 
5 NO:l, SEQ ID NO:4 or SEQ ID NO:7, and also any sequence that encodes SEQ ID 
NO:2, SEQ ID NO:5, or SEQ ID NO:8) encodes a polypeptide of approximately 1125 
amino acids. Related TC-ICS genes from other species share at least about 70% amino 
acid identity over an amino acid region at least about 25 amino acids in length, preferably 
50 to 100 amino acids in length. In situ hybridization demonstrates tissue and cell-type 

10 specificity in taste buds. 

Specific regions of the TC-ICS nucleotide and amino acid sequences are 
used to identify polymorphic variants, interspecies homologs, and alleles of TC-ICS. 
Especially useful are unique subsequences of SEQ ID NO:l, SEQ ID NO:4 and SEQ ID 
NO-7 that are at least 20, preferably at least 30, more preferably at least 50, most 
preferably at least 100 nucleotides long and that have at least 90-95% sequence homology 
with a subsequence present in SEQ ID NO: 1, SEQ ID NO:4 and SEQ ID NO:7. Also 
especially useful are unique subsequences of SEQ ID NO:3 and SEQ ID NO:6 that are at 
least 20, preferably at least 30, more preferably at least 50, most preferably at least 100 
nucleotides long and that have at least 90-95% sequence homology with a subsequence 
20 present in SEQ ID NO:3 and SEQ ID NO:6. This identification are made in vitro, e.g., 
under stringent hybridization conditions or with PCR and sequencing, or by using the 
sequence information in a computer system for comparison with other nucleotide or 
amino acid sequences. Typically, identification of polymorphic variants and alleles of 
TC-ICS is made by comparing an amino acid sequence of about 25 amino acids or more, 
25 preferably 50-100 amino acids. Amino acid identity of approximately at least 70% or 
above, preferably 80%, most preferably 90-95% or above typically demonstrates that a 
protein is a polymorphic variant, interspecies homolog, or allele of TC-ICS. Sequence 
comparisons are performed using any of the sequence comparison algorithms discussed 
below. Antibodies that bind specifically to TC-ICS or a conserved region thereof can 
30 also be used to identify alleles, interspecies homologs, and polymorphic variants. 

Polymorphic variants, interspecies homologs, and alleles of TC-ICS are 
confirmed by examining taste cell specific expression of the putative TC-ICS 
polypeptide. Typically, TC-ICS having the amino acid sequence of SEQ ID NO:2, SEQ 
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ID NO:5, or SEQ ID NO:8 is used as a positive control, e.g., in immunoassays using 
antibodies specifically directed against a protein having the amino acid'sequence of SEQ 
ID NO:2, SEQ ID NO:5, or SEQ ID NO:8, in comparison to the putative TC-ICS protein 
to demonstrate the identification of a polymorphic variant or allele of TC-ICS. 
5 Alternatively, TC-ICS having the nucleic acid sequences of SEQ ID NO:l, SEQ ID 
NO:4, or SEQ ID NO:7 is used as a positive control, e.g., in in situ hybridization with 
SEQ ID NO:l, SEQ ID NO:4, or SEQ ID NO:7, in comparison to the putative TC-ICS 
nucleotide sequences to demonstrate the identification of a polymorphic variant or allele 
ofTC-ICS. 

1 o TC-ICS nucleotide and amino acid sequence information may also be used 

to construct models of taste cell specific polypeptides in a computer system. These 
models are subsequently used to identify compounds that can activate or inhibit TC-ICS. 
Such compounds that modulate the activity of TC-ICS are used to investigate the role of 
TC-ICS in taste transduction or are used as therapeutics. 

1 5 Identification of taste cell specific expression of TC-ICS provides a means 

for assaying for inhibitors and activators of taste cell activity. TC-ICS is useful for 
testing taste modulators using in vivo and in vitro expression that measure, e.g., 
transcriptional activation of TC-ICS; ligand binding; phosphorylation and 
dephosphorylation; binding to G-proteins; G-protein activation; regulatory molecule 

20 binding; voltage, membrane potential and conductance changes; ion flux; intracellular 
second messengers such as cAMP and inositol triphosphate; intracellular calcium levels; 
and neurotransmitter release. Such activators and inhibitors identified using TC-ICS are 
used to further study taste transduction and to identify specific taste agonists and 
antagonists. Such activators and inhibitors are useful as pharmaceutical and food agents 

25 for customizing taste. 

Methods of detecting TC-ICS nucleic acids and expression of TC-ICS are 
also useful for identifying taste cells and creating topological maps of the tongue and the 
relation of tongue taste receptor cells to taste sensory neurons in the brain. Furthermore, 
these nucleic acids are used to diagnose diseases related to taste by using assays such as 

30 northern blotting, dot blotting, in situ hybridization, RNase protection, and the like. 
Chromosome localization of the genes encoding human TC-ICS can also be used to 
identify diseases, mutations, and traits caused by and associated with TC-ICS. 
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Techniques, such as high density oligonucleotide arrays (GeneChip™), are used to screen 
for mutations, polymorphic variants, alleles and interspecies homologs of TC-ICS. 

II. Definitions 

As used herein, the following terms have the meanings ascribed to them 
unless specified otherwise. 

"Taste receptor cells" are neuroepithelial cells that are organized into 
groups to form taste buds of the tongue, e.g., foliate, fungiform, and circumvallate cells 
(see, e.g., Roper et al, Ann. Rev. Neurosci. 12:329-353 (1989)). 

"Taste cell specific" genes or proteins refer to those which are expressed 
exclusively, or preferentially, in the taste receptor cells but not in non-taste cells, or in 
subsets of Gustducin positive cells. 

"Taste cell-specific ion channel subunit" or "TC-ICS" refers to a family of 
taste cell-specific ion channel subunits that are specifically expressed in taste receptor 
cells such as foliate, fungiform, and circumvallate cells. The family includes proteins 
having the amino acid sequences of, e.g., SEQ ID NO:2, SEQ ID NO:5, or SEQ ID NO:8, 
and they are encoded by cDNAs having the sequences of SEQ ID NO: 1, SEQ ID NO:4, 
or SEQ ID NO:7 and by genomic sequences such as, for example, SEQ ID NO:3 or SEQ 
ID NO:6. Such taste cells can be identified because they express molecules such as 
Gustducin, a taste cell specific G-protein (McLaughin et aU Nature 357:563-569 (1992)). 
Taste receptor cells can also be identified on the basis of morphology (see, e.g., Roper, 
supra). TC-ICS nucleic acids encode a taste cell-specific ion channel subunit with the 
ability to form a functional ion channel. 

The term TC-ICS therefore refers to polymorphic variants, alleles, 
mutants, and interspecies homologs that: (1) have about 70% amino acid sequence 
identity, preferably about 85-90% amino acid sequence identity to SEQ ID NO:2, SEQ ID 
NO:5, or SEQ ID NO:8 over a window of about 25 amino acids, preferably 50-100 amino 
acids; (2) bind to antibodies raised against an immunogen comprising an amino acid 
sequence of SEQ ID NO:2, SEQ ID NO:5, or SEQ ID NO:8 and conservatively modified 
variants thereof; or (3) specifically hybridize (with a size of at least about 500, preferably 
at least about 900 nucleotides) under stringent hybridization conditions to a nucleic acid 
having SEQ ID NOS: 1, 4, or 7, and conservatively modified variants thereof. 
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"TC-GPCR" refers to a G-protein coupled receptor that is specifically 
expressed in taste receptor cells such as foliate, fungiform, and circumvallate cells. Such 
taste cells can be identified because they express molecules such as Gustducin, a taste cell 
specific G-protein (McLaughin et al, Nature 357:563-569 (1992)). Taste receptor cells 
5 can also be identified on the basis of morphology {see, e.g., Roper, supra). TC-GPCR 
generally have seven transmembrane regions that have "G-protein coupled receptor 
activity," e.g., they bind to G-proteins in response to extracellular stimuli and promote 
production of second messengers such as LP 3 , cAMP, and Ca 2+ via stimulation of enzymes 
such as phospholipase C and adenylate cyclase (for a description of the structure and 
10 function of G-protein coupled receptors, see t e.g. , Fong, supra, and Baldwin, supra). 

A "host cell" is a naturally occurring cell or a transformed cell that 
contains an expression vector and supports the replication or expression of the expression 
vector. Host cells may be cultured cells, explants, cells in vivo, and the like. Host cells 
may be prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, 
15 amphibian, or mammalian cells such as CHO, HeLa, HEK 293 and the like. 

"Biological sample" as used herein is a sample of biological tissue or fluid 
that contains nucleic acids or polypeptides of TC-ICS. Such samples include, but are not 
limited to, tissue isolated from humans, mice, and rats, in particular, tongue. Biological 
samples may also include sections of tissues such as frozen sections taken for histological 
20 purposes. A biological sample is typically obtained from a eukaryotic organism, such as 
insects, protozoa, birds, fish, reptiles, and preferably a mammal such as rat, mouse, cow, 
dog, guinea pig, or rabbit, and most preferably a primate such as chimpanzees or humans. 
Preferred tissues include tongue tissue, isolated taste buds, and testis tissue. 

The phrase "functional effect" in the context of assays for testing 
25 compounds that modulate TC-ICS-mediated taste transduction includes the determination 
of any parameter that is indirectly or directly under the influence of TC-ICS. It includes 
changes in ion flux, membrane potential, current flow, transcription, G-protein binding, 
GPCR phosphorylation or dephosphorylation, signal transduction, receptor-ligand 
interactions, second messenger concentrations {e.g., cAMP, IP3, or intracellular Ca 2+ ), in 
30 vitro, in vivo, and ex vivo and also includes other physiologic effects such increases or 
decreases of neurotransmitter or hormone release. 

A "pharmacologically effective amount of a composition that modulates 
taste signaling by an ion channel subunit" is an amount of a composition (which may 
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consist of a single chemical compound or a mixture of chemical compounds, preferably 
combined with a carrier such as a solvent) that is effective to detectably alter a 
measurable property or a functional effect of a TC-ICS. The TC-ICS may be in soluuon, 
or expressed in a naturally occurring cell, in a tissue cultured cell, in a recombinant cell, 
5 or in a wild type or recombinant organism. In a main embodiment, the TC-ICS is present 
at least at the surface membrane of a cell which may be in a tissue culture or in a tissue of 
a live multicellular organism, especially a mammal. The precise value of the effective 
amount varies according to the compound and the species, age, sex, condition and health 
of the cells or organism that contains the TC-ICS. In solution, typically the effective 
10 amount is an amount sufficient to yield a concentration of at least 10 nM - 10 mM, 
preferably at least 0.1 uM to 1 mM, and more preferably 10 to 100 uM. When 
administered to a subject, the effective amount is from about 1 ng/kg to 10 mg/kg for a 
typical subject. 

"Determining the functional effect" denotes assays for a compound that 
15 increases or decreases a parameter that is indirectly or directly under the influence of TC- 
ICS. Such functional effects are measured by any means known to those skilled in the 
art e g., patch clamping, voltage-sensitive dyes, whole cell currents, radioisotope efflux, 
inducible markers, oocyte or tissue culture cell expression of TC-ICS; transcriptional 
activation of TC-ICS; ligand binding assays; voltage, membrane potential and 
20 conductance changes; ion flux assays; changes in intracellular second messengers such as 
cAMP and inositol triphosphate (IPs); changes in intracellular calcium levels; 
neurotransmitter release, and the like. 

"Inhibitors," "activators," and "modulators" of TC-ICS refer to inhibitory 
or activating molecules identified using in vitro and in vivo assays for taste transduction, 
25 e.g., ligands, agonists, antagonists, and their homologs andmimetics. Inhibitors are 
compounds that decrease, block, prevent, delay activation, inactivate, desensitize, or 
down regulate taste transduction, e.g., antagonists. Activators are compounds that 
increase, open, activate, facilitate, enhance activation, sensitize or up regulate taste 
transduction, e.g., agonists. Modulators include genetically modified versions of TC-ICS, 
30 e«, with altered activity, as well as naturally occurring and synthetic ligands, 

antagonists, agonists, small chemical molecules and the like. Such assays for inhibitors 
and activators include, e.g., expressing TC-ICS in cells or cell membranes, applying 
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putative modulator compounds, and then determining the functional effects on taste 
transduction, as described above. Compounds identified by these assays are typically 
combined with food or medicine and used to alter its taste to the subject (mammalian, 
preferably a human) who ingests it. 

5 Samples or assays comprising TC-ICS that are treated with a potential 

activator, inhibitor, or modulator are compared to control samples without the inhibitor, 
activator, or modulator to examine the extent of inhibition. Control samples (untreated 
with inhibitors) are assigned a relative TC-ICS activity value of 100%. Inhibition of TC- 
ICS is achieved when the TC-ICS activity value relative to the control is about 80%, 

10 preferably 50%, more preferably 25-1%. Activation of TC-ICS is achieved when the TC- 
ICS activity value relative to the control (untreated with activators) is 1 10%, more 
preferably 1 50%, more preferably 200-500%, more preferably 1000-3000% higher. 

The terms "isolated," "purified" or "biologically pure" refer to material 
that is substantially or essentially free from components which normally accompany it as 

15 found in its native state. Purity and homogeneity are typically determined using 

analytical chemistry techniques such as polyacrylamide gel electrophoresis or high 
performance liquid chromatography. A protein that is the predominant species present in 
a preparation is substantially purified. In particular, an isolated TC-ICS nucleic acid is 
separated from open reading frames that flank the TC-ICS gene and encode proteins other 

20 than TC-ICS. The term "purified" denotes that a nucleic acid or protein gives rise to 

essentially one band in an electrophoretic gel. Particularly, it means that the nucleic acid 
or protein is at least 85% pure, more preferably at least 95% pure, and most preferably at 
least 99% pure. 

"Nucleic acid" refers to deoxyribonucleotides or ribonucleotides and 
25 polymers thereof in either single- or double-stranded form. The term encompasses 
nucleic acids containing known nucleotide analogs or modified backbone residues or 
linkages, which are synthetic, naturally occurring, and non-naturally occurring, which 
have similar binding properties as the reference nucleic acid, and which are metabolized 
in a manner similar to the reference nucleotides. Examples of such analogs include, 
30 without limitation, phosphorothioates, phosphoramidates, methyl phosphonates, chiral- 
methyl phosphonates, 2-O-methyl ribonucleotides, peptide-nucleic acids (PNAs). 

Unless otherwise indicated, a particular nucleic acid sequence also 
implicitly encompasses conservatively modified variants thereof {e.g., degenerate codon 
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substitutions) and complementary sequences, as well as the sequence explicitly indicated. 
The term nucleic acid is used interchangeably with gene, cDNA, mRNA, oligonucleotide, 

and polynucleotide. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 

5 herein to refer to a polymer of amino acid residues. The terms apply to amino acid 
polymers in which one or more amino acid residue is an analog or mimetic of a 
corresponding naturally occurring amino acid, as well as to naturally occurring amino 
acid polymers. Polypeptides are modified, e.g., by the addition of carbohydrate residues 
to form glycoproteins. The terms "polypeptide," "peptide" and "protein" include 
10 glycoproteins, as well as non-glycoproteins. 

The term "amino acid" refers to naturally occurring and synthetic amino 
acids, as well as amino acid analogs and amino acid mimetics that function in a manner 
similar to the naturally occurring amino acids. Naturally occurring amino acids are those 
encoded by the genetic code, as well as those amino acids that are later modified, e.g., 
15 hydroxyproline, carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to 
compounds that have the same basic chemical structure as a naturally occurring amino 
acid, i.e., an carbon that is bound to ahydrogen, a carboxyl group, an amino group, and 
an R group., e.g., homoserine, norleucine, methionine sulfoxide, methionine methyl 
sulfonium. Such analogs have modified R groups {e.g., norleucine) or modified peptide 
20 backbones, but retain the same basic chemical structure as a naturally occurring ammo 
acid. Amino acid mimetics refers to chemical compounds that have a structure that is 
different from the general chemical structure of an amino acid, but that functions in a 
manner similar to a naturally occurring amino acid. 

Amino acids may be referred to herein by either their commonly known 
25 three letter symbols or by the one-letter symbols recommended by the IUP AC-IUB 

Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by 
their commonly accepted single-letter codes (A, T, G, C, U, etc.). 

"Conservatively modified variants" applies to both amino acid and nucleic 
acid sequences. With respect to particular nucleic acid sequences, conservatively 
30 modified variants refers to those nucleic acids which encode identical or essentially 

identical amino acid sequences, or where the nucleic acid does not encode an amino acid 
sequence, to essentially identical sequences. Specifically, degenerate codon substitutions 
may be achieved by generating sequences in which the third position of one or more 
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selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues 
(Batzer et al, Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al, J. Biol Chem. 260:2605- 
2608 (1985); Rossolini et ah, Mol. Cell Probes 8:91-98 (1994)). Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids 
5 encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all 

encode the amino acid alanine. Thus, at every position where an alanine is specified by a 
codon in an amino acid herein, the codon can be altered to any of the corresponding 
codons described without altering the encoded polypeptide. Such nucleic acid variations 
are "silent variations," which are one species of conservatively modified variations. 
10 Every nucleic acid sequence herein which encodes a polypeptide also describes every 

possible silent variation of the nucleic acid. One of skill will recognize that each codon in 
a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, 
which is ordinarily the only codon for tryptophan) can be modified to yield a functionally 
identical molecule. Accordingly, each silent variation of a nucleic acid which encodes a 
15 polypeptide is implicit in each described sequence. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
20 results in the substitution of an amino acid with a chemically similar amino acid. 

Conservative substitution tables providing functionally similar amino acids are w«U 
known in the art. Such conservatively modified variants are in addition to and do not 
exclude polymorphic variants and alleles of the invention. 

The following groups each contain amino acids that are conservative 
25 substitutions for one another: 



1) 


Alanine (A), Glycine (G); 


2) 


Serine (S), Threonine (T); 


3) 


Aspartic acid (D), Glutamic acid (E); 


4) 


Asparagine (N), Glutamine (Q); 


5) 


Cysteine (C), Methionine (M); 


6) 


Arginine (R), Lysine (K), Histidine (H); 


7) 


Isoleucine (I), Leucine (L), Valine (V); and 


8) 


Phenylalanine (F), Tyrosine (Y), Tryptophan (W). 
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{see, e.g., Creighton, Proteins (1984) for a discussion of amino acid 

properties). 

A "label" or a "detectable moiety" is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, or chemical means. For 
example, useful labels include 32 P, 45 Ca, fluorescent groups, molecules or dyes, 
electron-dense reagents, enzymes (e.g., as commonly used in an ELISA), biotin, 
digoxigenin, or haptens and proteins for which specific detectable ligands (such as 
antibodies) exist or can be made (e.g., by incorporating a radiolabel into the ligand). 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, 
either covalently, through a linker or a chemical bond, or noncovalently, through ionic, 
van dcr Waals, electrostatic, or hydrogen bonds to a label such that the presence of the 
probe may be detected by detecting the presence of the label bound to the probe. 

As used herein a "nucleic acid probe or oligonucleotide" is defined as a 
nucleic acid capable of binding to a target nucleic acid of complementary sequence 
through one or more types of chemical bonds, usually through complementary base 
pairing, usually through hydrogen bond formation. As used herein, a probe may include 
natural {i.e., A, G, C, or T) or modified bases (7-deazaguanosine, inosine, etc.). In 
addition, the bases in a probe may be joined by a linkage other than a phosphodiester 
bond, so long as it does not interfere with hybridization. Thus, for example, probes may 
be peptide nucleic acids in which the constituent bases are joined by peptide bonds rather 
than phosphodiester linkages. It will be understood by one of skill in the art that probes 
may bind target sequences lacking complete complementarity with the probe sequence 
depending upon the stringency of the hybridization conditions. The probes are preferably 
directly labeled as with isotopes, chromophores, lumiphores, chromogens, or indirectly 
labeled such as with biotin to which a streptavidin complex may later bind. By assaying 
for the presence or absence of the probe, one can detect the presence or absence of the 
select sequence or subsequence. 

The term "recombinant" when used with reference, e.g., to a cell, or 
nucleic acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has 
been modified by the introduction of a heterologous nucleic acid or protein or the 
alteration of a native nucleic acid or protein, or that the cell is derived from a cell so 
modified. Thus, for example, recombinant cells express genes that are not found within 



WO 02/054069 PCT/U SO 1/49808 

16 

the native (non-recombinant) form of the cell or express native genes that are otherwise 
abnormally expressed, under expressed or not expressed. at all. 

The term "heterologous" when used with reference to portions of a nucleic 
acid indicates that the nucleic acid comprises two or more subsequences that are not 
5 found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein indicates that the 
protein comprises two or more subsequences that are not found in the same relationship to 
10 each other in nature (e.g., a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid. As used herein, a promoter includes necessary 
nucleic acid sequences near the start site of transcription, such as in the case of a 
polymerase II type promoter, a TATA element. A promoter also optionally includes 
15 distal enhancer or repressor elements, which can be located as much as several thousand 
base pairs from the start site of transcription. A "constitutive" promoter is a promoter that 
is active under most environmental and developmental conditions. An "inducible" 
promoter is a promoter that is active under environmental or developmental regulation. 
The term "operably linked" refers to a functional linkage between a nucleic acid 
20 expression control sequence (such as a promoter, or array of transcription factor binding 
sites) and a second nucleic acid sequence, wherein the expression control sequence 
directs transcription of the nucleic acid corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated 
recombinantly or synthetically, with a series of specified nucleic acid elements that 
25 permit transcription of a particular nucleic acid in a host cell. The expression vector can 
be part of a plasmid, virus, or nucleic acid fragment. Typically, the expression vector 
includes a nucleic acid to be transcribed operably linked to a promoter. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences 
30 that are the same or have a specified percentage of amino acid residues or nucleotides that 
are the same, when compared and aligned for maximum correspondence over a 
comparison window, as measured using one of the following sequence comparison 
algorithms or by manual alignment and visual inspection. Such sequences are then said 
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to be "substantially identical." This definition also refers to the complement of a test 
sequence. Preferably, the percent identity exists over a region of the sequence that is at 
least about 25 amino acids in length, more preferably over a region that is 50 or 100 

amino acids in length. 

For sequence comparison, one sequence acts as a reference sequence, to 
which test sequences are compared. When using a sequence comparison algorithm, test 
and reference sequences are entered into a computer, subsequence coordinates are 
designated, if necessary, and sequence algorithm program parameters are designated. 
Default program parameters can be used, or alternative parameters can be designated. 
The sequence comparison algorithm then calculates the percent sequence identities for the 
test sequences relative to the reference sequence, based on the program parameters. 

A "comparison window," as used herein, includes reference to a segment 
of contiguous positions selected from the group consisting of from 20 to 600, usually 
about 50 to about 200, more usually about 100 to about 150 in which a sequence may be 
compared to a reference sequence of the same number of contiguous positions after the 
two sequences are optimally aligned. Methods of alignment of sequences for comparison 
are well-known in the art. Optimal alignment of sequences for comparison can be 
conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. 
Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. 
Mol. Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. 
Nat'l. Acad. Sri. USA 85:2444 (1988), by computerized implementations of these 
algorithms (GAP, BESTFTT, FASTA, and TFASTA in the Wisconsin Genetics Software 
Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual 

alignment and visual inspection. 

One example of a useful algorithm is PILEUP. PILEUP creates a multiple 
. sequence alignment from a group of related sequences using progressive, pairwise 
alignments to show relationship and percent sequence identity. It also plots a tree or 
dendogram showing the clustering relationships used to create the alignment. PILEUP 
uses a simplification of the progressive alignment method of Feng & Doolittle, /. Mol. 
) Evol. 35:35 1 -360 (1987). The method used is similar to the method described by Higgins 
& Sharp, CABIOS 5:151-153 (1989). The program can align up to 300 sequences, each 
of a maximum length of 5,000 nucleotides or amino acids. The multiple ahgnment 
procedure begins with the pairwise alignment of the two most similar sequences, 
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producing a cluster of two aligned sequences. This cluster is then aligned to the next 
most related sequence or cluster of aligned sequences. Two clusters of sequences are 
aligned by a simple extension of the pairwise alignment of two individual sequences. The 
final alignment is achieved by a series of progressive, pairwise alignments. The program 

5 is run by designating specific sequences and their amino acid or nucleotide coordinates 
for regions of sequence comparison and by designating the program parameters. Using 
PILEUP, a reference sequence is compared to other test sequences to determine the 
percent sequence identity relationship using the following parameters: default gap weight 
(3.00), default gap length weight (0.10), and weighted end gaps. PILEUP can be obtained 

10 from the GCG sequence analysis software package, e.g., version 7.0 (Devereaux et al, 

Nuc. Acids Res. 12:387-395 (1984)). 

Another example of algorithm that is suitable for determining percent 
sequence identity and sequence similarity is the BLAST algorithm, which is described in 
Altschul et al.,J. Mol. Biol 215:403-410 (1990). Software for performing BLAST 
15 analyses is publicly available through the National Center for Biotechnology Information 
(http://wvm.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 
sequence pairs (HSPs) by identifying short words of length W in the query sequence, 
which either match or satisfy some positive-valued threshold score T when aligned with a 
word of the same length in a database sequence. T is referred to as the neighborhood 
20 word score threshold (Altschul et al, supra). These initial neighborhood word hits act as 
seeds for initiating searches to find longerHSPs containing them. The word hits are 
extended in both directions along each sequence for as far as the cumulative alignment 
score can be increased. Extension of the word hits in each direction are halted when: the 
cumulative alignment score falls off by the quantity X from its maximum achieved value; 
25 the cumulative score goes to zero or below, due to the accumulation of one or more 
negative-scoring residue alignments; or the end of either sequence is reached. The 
BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the 
alignment. The BLAST program uses as defaults a word length (W) of 1 1, the 
BLOSUM62 scoring matrix {see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 
30 89: 1091 5 (1989)) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a 

comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences {see, e.g., Karlin & Altschul, Proc. Nat'!. Acad. Sci. USA 
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90:5873-5787(1993)). One measure of similarity provided by the BLAST algorithm is 
the smallest sum probability (P(N)), which provides an indication of the probability by 
which a match between two nucleotide or amino acid sequences would occur by chance. 
For example, a nucleic acid is considered similar to a reference sequence if the smallest 
5 sum probability in a comparison of the test nucleic acid to the reference nucleic acid is 
less than about 0.2, more preferably less than about 0.01, and most preferably less than 
about 0.001. 

An indication that two nucleic acid sequences or polypeptides are 
substantially identical is that the polypeptide encoded by the first nucleic acid is 
10 immunologically cross reactive with the antibodies raised against the polypeptide 

encoded by the second nucleic acid, as described below. Thus, a polypeptide is typically 
substantially identical to a second polypeptide, for example, where the two peptides differ 
only by conservative substitutions. Another indication that two nucleic acid sequences 
are substantially identical is that the two molecules or their complements hybridize to 
1 5 each other under stringent conditions, as described below. 

The phrase "selectively (or specifically) hybridizes to" refers to the 
binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence 
under stringent hybridization conditions when that sequence is present in a complex 
mixture (e.g., total cellular or library DNA or RNA). 
20 The phrase "stringent hybridization conditions" refers to conditions under 

which a probe will hybridize to its target subsequence, typically in a complex mixture of 
nucleic acid, but to no other sequences. Stringent conditions are sequence-dependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 
25 Tijssen, Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic 
Probes, "Overview of principles of hybridization and the strategy of nucleic acid assays" 
(1993). Generally, stringent conditions are selected to be about 5-10° C lower than the 
thermal melting point (T m ) for the specific sequence at a defined ionic strength pH. The 
T m is the temperature (under defined ionic strength, pH, and nucleic concentration) at 
30 which 50% of the probes complementary to the target hybridize to the target sequence at 
equilibrium (as the target sequences are present in excess, at T m , 50% of the probes are 
occupied at equilibrium). Stringent conditions will be those in which the salt 
concentration is less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium 
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ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° 
C for short probes {e.g., 10 to 50 nucleotides) and at least about 60° C for long probes 
(e.g. , greater than 5 0 nucleotides) . Stringent conditions may also be achieved with the 
addition of destabilizing agents such as formamide. For selective or specific 
5 hybridization, a positive signal is at least two times background, preferably 10 times 
background hybridization. Exemplary stringent hybridization conditions can be as 
following: 50% formamide, 5x SSC, and 1% SDS, incubating at 42° C, or, 5x SSC, 1% 
SDS, incubating at 65° C, with wash in 0.2x SSC, and 0.1% SDS at 65° C. 

Nucleic acids that do not hybridize to each other under stringent conditions 
10 are still substantially identical if the polypeptides which they encode are substantially 
identical. This occurs, for example, when a copy of a nucleic acid is created using the 
maximum codon degeneracy permitted by the genetic code. In such cases, the nucleic 
acids typically hybridize under moderately stringent hybridization conditions. Exemplary 
"moderately stringent hybridization conditions" include a hybridization in a buffer of 
1 5 40% formamide, 1 M NaCl, 1 % SDS at 37° C, and a wash in IX SSC at 45° C. A positive 
hybridization is at least twice background. Those of ordinary skill will readily recognize 
that alternative hybridization and wash conditions can be utilized to provide conditions of 

similar stringency. 

A further indication that two polynucleotides are substantially identical is 

20 if the reference sequence, amplified by a pair of oligonucleotide primers, can then be used 
as a probe under stringent hybridization conditions to isolate the test sequence from a 
cDNA or genomic library, or to identify the test sequence in, e.g., a northern or Southern 
blot. Alternatively, another indication that the sequences are substantially identical is if 
the same set of PCR primers can be used to amplify both sequences. 

25 "Antibody" refers to a polypeptide encoded by an immunoglobulin gene or 

fragments thereof that specifically binds and recognizes an antigen. The recognized 
immunoglobulin genes include" the kappa, lambda, alpha, gamma, delta, epsilon, and mu 
constant region genes, as well as the myriad immunoglobulin variable region genes. 
Light chains are classified as either kappa or lambda. Heavy chains are classified as 

30 gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, 
IgG, IgM, IgA, IgD and IgE, respectively. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each 
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pair having one "light" (about 25 kDa) and one "heavy" chain (about 50-70 kDa). The 
N-tenninus of each chain defines a variable region of about 100 to 110 or more amino 
acids primarily responsible for antigen recognition. The terms variable light chain (V L ) 
and variable heavy chain (V H ) refer to these light and heavy chains respectively. 
5 Antibodies exist, e.g., as intact immunoglobulins or as a number of well 

characterized fragments produced by digestion with various peptidases. Thus, for 
example, pepsin digests an antibody below the disulfide linkages in the hinge region to 
produce F(ab)' 2l a dimer of Fab which itself is a light chain joined to V H -C H 1 by a 
disulfide bond. The F(ab)' 2 may be reduced under mild conditions to break the disulfide 
10 linkage in the hinge region, thereby converting the F(ab)' 2 dimer into an Fab' monomer. 
The Fab' monomer is essentially an Fab with part of the hinge region (see Fundamental 
Immunology (Paul ed., 3d ed. 1993)). While various antibody fragments are defined in 
terms of the digestion of an intact antibody, one of skill will appreciate that such 
fragments may be synthesized de novo either chemically or by using recombinant DNA 
15 memodology. Thus, the term antibody, as used herein, also includes antibody fragments 
either produced by the modification of whole antibodies or those synthesized de novo 
using recombinant DNA methodologies (e.g. , single chain Fv). 

A "chimeric antibody" is an antibody molecule in which (a) the constant 
region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding 
20 site (variable region) is linked to a constant region of a different or altered class, effector 
function and/or species, or an entirely different molecule which confers new properties to 
the chimeric antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) 
the variable region, or a portion thereof, is altered, replaced or exchanged with a variable 
region having a different or altered antigen specificity. 
25 For preparation of monoclonal or polyclonal antibodies, any technique 

known in the art can be used (see, e.g., Kohler & Milstein, Nature 256:495-497 (1975); 
Kozbor et al, Immunology Today 4:72 (1983); Cole et al, pp. 77-96 in Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc. (1985)). Techniques for the 
production of single chain antibodies (U.S. Patent 4,946,778) can be adapted to produce 
30 antibodies to polypeptides of this invention. Also, transgenic mice, or other organisms 
such as other mammals, may be used to express humanized antibodies. Alternatively, 
phage display technology can be used to identify antibodies and heteromeric Fab 
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fragments that specifically bind to selected antigens {see, e.g., McCafferty et al, Nature 
348, 552-554 (1990); Marks et al, Biotechnology 10, 779-783 (1992)). 

An "TC-ICS" antibody is an antibody or antibody fragment that 
specifically binds a polypeptide encoded by the TC-ICS gene, cDNA, or a subsequence 
5 thereof. 

The term "immunoassay" is an assay that uses an antibody to specifically 
bind an antigen. The immunoassay is characterized by the use of specific binding 
properties of a particular antibody to isolate, target, and/or quantify the antigen. 

The phrase "specifically (or selectively) binds" to an antibody or 
10 "specifically (or selectively) immunoreactive with," when referring to a protein or 

peptide, refers to a binding reaction that is determinative of the presence of the protein in 
a heterogeneous population of proteins and other biologies. Thus, under designated 
immunoassay conditions, the specified antibodies bind to a particular protein at least two 
times the background and do not substantially bind in a significant amount to other 
15 proteins present in the sample. Specific binding to an antibody under such conditions 
may require an antibody that is selected for its specificity for a particular protein. For 
example, polyclonal antibodies raised to TC-ICS from specific species such as rat, mouse, 
or human can be selected to obtain only those polyclonal antibodies that are specifically 
immunoreactive with TC-ICS and not with other proteins, except for polymorphic 
20 variants and alleles of TC-ICS. This selection may be achieved by subtracting out 
antibodies that cross-react with TC-ICS molecules from other species. A variety of 
immunoassay formats may be used to select antibodies specifically immunoreactive with 
a particular protein. For example, solid-phase ELISA immunoassays are routinely used to 
select antibodies specifically immunoreactive with a protein (see. e.g., Harlow & Lane, 
25 Antibodies. A Laboratory Manual (1988), for a description of immunoassay formats and 
conditions that can be used to determine specific immunoreactivity). Typically a specific 
or selective reaction will be at least twice background signal or noise and more typically 
more than 10 to 100 times background. 

The phrase "selectively associates with" refers to the ability of a nucleic 
30 acid to "selectively hybridize" with another as defined above, or the ability of an antibody 
to "selectively (or specifically) bind" to a protein, as defined above. 



in. Assays for taste modulation 
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A. Assays for taste cell-specific ion channel subunit activity 
TC-ICS and its alleles, interspecies homologs, and polymorphic variants 
participate in taste transduction. The activity of TC-ICS polypeptides can be assessed 
using a variety of in vitro and in vivo assays, e.g., measuring second messenger (e.g., 
5 cAMP, cGMP, IP 3 , DAG, or Ca 2+ ), ion flux, phosphorylation levels, transcription levels, 
neurotransmitter levels, and the like. Furthermore, such assays can be used to screen for 
activators, inhibitors, and modulators of TC-ICS. Such activators, inhibitors, and 
modulators of taste transduction activity are useful for customizing taste. 

Biologically active TC-ICS polypeptides, either recombinant or naturally 
1 0 occurring, are used to screen activators, inhibitors, or modulators of taste. The TC-ICS 
polypeptides are isolated, e.g., expressed in a cell, expressed in a membrane derived from 
a cell, expressed in tissue or in an animal, either recombinant or naturally occurring. For 
example, tongue slices, dissociated cells from a tongue, transformed cells, or membranes 
are used. Taste transduction can also be examined in vitro with soluble or solid state 
15 reactions. Preferably, TC-ICS of the assay will be selected from a polypeptide having a 
sequence of SEQ ID NO:2, SEQ ID NO:5, or SEQ ID NO:8, or conservatively modified 
variant thereof. Alternatively, TC-ICS of the assay will be derived from a eukaryote and 
includes an amino acid subsequence having amino acid sequence identity to SEQ ID 
NO:2, SEQ ID NO:5, or SEQ ID NO:8. Generally, the amino acid sequence identity will 
20 be at least 70%, preferably at least 85%, most preferably at least 90-95%. 

Samples or assays that are treated with a test compound which potentially 
activates, inhibits, or modulates TC-ICS are compared to control samples that are not 
treated without the test compound, to examine the extent of modulation. Control samples 
(untreated with activators, inhibitors, or modulators) are assigned a relative TC-ICS 
25 activity value of 100%. Inhibition of TC-ICS is achieved when the TC-ICS activity value 
relative to the control is about 90% (e.g., 10% less than the control), preferably 50%, 
more preferably 25-5%, most preferably 5-0%. Activation of TC-ICS is achieved when 
the TC-ICS activity value relative to the control is 1 10% (e.g. , 1 0% more than the 
control), more preferably 150%, more preferably 200-500%, more preferably 1000- 
30 2000%, or more than 2000% (e.g. , 1 0,000%). 

In one embodiment, the activity of TC-ICS polypeptides is assessed by 
measuring, e.g., changes in intracellular second messengers, such as cAMP, cGMP, IP 3 , 
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DAG, or Ca 2+ . Therefore, the second messenger levels are used as reporters for potential 
activators, inhibitors, and modulators of TC-ICS polypeptides. 

Ion channel modulation typically initiates or inhibits subsequent 
intracellular events via, e.g., G-proteins and/or other enzymes, such as adenylate cyclase 
5 or phospholipase C, which are downstream from the ion channel-mediated events in taste 
transduction pathways. For example, ion channel activation may result in a change in the 
level of intracellular cyclic nucleotides, e.g., cAMP or cGMP, by activating or inhibiting 
enzymes such as adenylate cyclase by G-protein a and |fy subunits. These intracellular 
cyclic nucleotides, in turn, may modulate other molecules, such as, cyclic nucleotide- 
10 gated ion channels, e.g. , channels that are made permeable to cations by binding of cAMP 
or cGMP {see, e.g., Altenhofen et al, Proc. Natl. Acad. Sci. U.S.A. 88:9868-9872 (1991) 
and Dhallan et al, Nature 347:184-187 (1990)). Cells for this type of assay are made by 
co-transfection of a host cell with any one or a combination of DNA encoding a cyclic 
nucleotide-gated ion channel, GPCR phosphatase, DNA encoding TC-ICS, and DNA 
1 5 encoding a G-protein coupled receptor. The receptor may be, e.g. , metabotropic 

glutamate receptors, muscarinic acetylcholine receptors, dopamine receptors, serotonin 
receptors, and the like, which, when activated, causes a change in cyclic nucleotide levels 
in the cytoplasm. 

In response to external stimuli, certain ion channels may activate other 
20 effectors, such as phospholipase C, through G-proteins, GPCRs, modulating enzyme 

activities, or other ion channels. Activation of phospholipase C results in the production 
of inositol 1, 4, 5-triphosphate (IP3) and diacylglycerol (DAG) from inositol 4,5- 
biphosphate (PIP2) (Berridge & Irvine, Nature 312:315-21 (1984)). IP 3 in turn stimulates 
the release of intracellular calcium ion stores. Cells may exhibit increased cytoplasmic 
25 calcium levels as a result of contribution from both intracellular stores and via activation 
of ion channels, in which case it may be desirable although not necessary to conduct such 
assays in calcium-free buffer, optionally supplemented with a chelating agent such as 
EGTA, to distinguish fluorescence response resulting from calcium release from internal 
stores. Thus, a change in the level of second messengers, such as IP 3 , DAG, or Ca can 
30 be used to assess TC-ICS function. Furthermore, a change in the level of these second 
messengers are used to screen for activators, inhibitors, and modulators of TC-ICS 
polypeptides. 
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In one embodiment, the changes in intracellular cAMP or cGMP are 
measured using immunoassays. The method described in Offermanns & Simon, J. Biol. 
Chem. 270:15175-15180 (1995) may be used to determine the level of cAMP. Also, the 
method described in Felley-Bosco etal.,Am. J. Resp. Cell andMol. Biol. 11:159-164 
5 (1994) may be used to determine the level of cGMP. Further, an assay kit for measuring 
cAMP and/or cGMP is described in U.S. Patent 4,1 15,538, herein incorporated by 
reference. 

In another embodiment, phosphatidyl inositol (PI) hydrolysis are analyzed 
according to U.S. Patent 5,436,128, herein incorporated by reference. Briefly, the assay 

10 involves labeling of cells with 3 H-myoinositol for 48 or more hrs. The labeled cells are 
treated with a test compound for one hour. The treated cells are lysed and extracted in 
chloroform-methanol-water after which the inositol phosphates were separated by ion 
exchange chromatography and quantified by scintillation counting. Fold stimulation is 
determined by calculating the ratio of cpm in the presence of agonist to cpm in the 

1 5 presence of buffer control. Likewise, fold inhibition is determined by calculating the 

ratio of cpm in the presence of antagonist to cpm in the presence of buffer control (which 
may or may not contain an agonist). 

In another embodiment, intracellular Ca 2+ levels are analyzed, e.g., using 
fluorescent Ca 2+ indicator dyes and fluorometric imaging {see, e.g., Hall et al., Nature 

20 331:729 (1988); Kudo et al, Neuros. 50:619-625 (1992); van Heugten et al, J. Mol Cell 

Cardiol. 26:1081-93 (1994)). 

In another embodiment, the activity of TC-ICS can also be assessed by 
measuring changes in ion flux. Changes in ion flux may be measured by determining 
changes in polarization (i.e., electrical potential) of the cell or membrane expressing TC- 

25 ICS. One means to determine changes in cellular polarization is by measuring changes in 
current (thereby measuring changes in polarization) with voltage-clamp and patch-clamp 
techniques, e.g., the "cell-attached" mode, the "inside-out" mode, and the "whole cell" 
mode (see, e.g., Ackerman et al, New Engl J. Med. 336:1575-1595 (1997)). Whole cell 
currents are conveniently determined using the standard methodology (see, e.g., Hamil et 

30 al, PFlugers. Archiv. 391:85 (1981)). Other known assays include: radiolabeled ion flux 
assays and fluorescence assays using voltage-sensitive dyes (see, e.g., Vestergarrd- 
Bogind et al, J. Membrane Biol. 88:67-75 (1988); Gonzales & Tsien, Chem. Biol. 4:269- 
277 (1997); Daniel et al, J. Pharmacol. Meth. 25:185-193 (1991); Holevinsky et al, J. 
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Membrane Biology 137:59-70 (1994)). A method for the whole-cell recording from non- 
dissociated taste cells within mouse taste bud is described in Miyamoto et al, J. Neurosci 
Methods 64:245-252 (1996). Therefore, changes in ion flux are used to screen for 
activators, inhibitors, and modulators of TC-ICS. Generally, the compounds to be tested 
5 are present in the range from 1 pM to 100 mM. 

Assays for measuring changes in ion flux include cells that are loaded with 
ion or voltage sensitive dyes to report TC-ICS activity. Assays for detennining activity 
of these polypeptides can also use known agonists and antagonists for these polypeptides 
as negative or positive controls to assess activity of tested compounds. In assays for 
1 0 identifying modulatory compounds {e.g. , agonists, antagonists), changes in the level of 
ions in the cytoplasm or membrane voltage will be monitored using an ion sensitive or 
membrane voltage fluorescent indicator, respectively. Among the ion-sensitive indicators 
and voltage probes that may be employed are those disclosed in the Molecular Probes 
1997 Catalog. 

1 5 In another embodiment, phosphorylation of taste cell specific proteins are 

measured to assess the effects of a test compound on TC-ICS function. This can be 
achieved by using a method disclosed in, e.g., U.S. Patent 5,834,216, herein incorporated 
by reference. A duplicate cell culture containing expressed TC-ICS is prepared. One of 
the duplicate cultures is exposed to a test compound. Cell lysates from the duplicate 
20 cultures are prepared. The cell lysates are contacted with ATP wherein the ATP has a 
gamma-phosphate having a detectable label, or an analog of a gamma phosphate (i.e., 
having a label capable of being transferred to a phosphorylation site such as gamma S 35 ). 
The level of phosphorylated taste cell specific proteins may be measured by precipitating 
the cell lysates with an antibody specific for taste cell specific proteins. After 
25 precipitation, phosphorylated (labeled) taste cell specific proteins may be separated from 
other cellular proteins by electrophoresis or by chromatographic methods. By way of 
example, labeled taste cell specific proteins may be separated on denaturing 
polyacrylamide gels after which the separated proteins may be transferred to, for 
example, a nylon or nitrocellulose membrane followed by exposure to X-ray film. 
30 Relative levels of phosphorylation are then determined after developing the exposed X- 
ray film and quantifying the density of bands corresponding to the taste cell specific 
proteins, for example, densitometry. The autoradiograph may also be used to localize the 
bands on the membrane corresponding to labeled taste cell specific proteins after which 
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they may be excised from the membrane and counted by liquid scintillation or other 
counting methods. Using this method, a test compound which effects the function of TC- 
ICS is identified by its ability to increase or decrease phosphorylation of taste cell specific 
proteins compared to control cells not exposed to the test compound. 
5 In another embodiment, transcription levels are measured to assess the 

effects of a test compound on TC-ICS function. A host cell containing TC-ICS is 
contacted with a test compound for a sufficient time to effect any interactions, and then 
the level of TC-ICS gene expression is measured. The amount of time to effect such 
interactions may be empirically determined, such as by running a time course and 
10 measuring the level of transcription as a function of time. The amount of transcription 
may be measured by using any method known to those of skill in the art to be suitable. 
For example, mRNA expression of TC-ICS may be detected using northern blots or their 
polypeptide products may be identified using immunoassays. Alternatively, transcription 
based assays using reporter gene may be used as described in U.S. Patent 5,436,128, 
15 herein incorporated by reference. The reporter genes can be, e.g., chloramphenicol 
acetyltransferase, firefly luciferase, bacterial luciferase, P-galactosidase and alkaline 
phosphatase. Furthermore, TC-ICS can be used as indirect reporters via attachment to a 
second reporter such as green fluorescent protein {see, e.g., Mistili & Spector, Nature 
Biotechnology 15:961-964 (1997)). 
20 The amount of transcription is then compared to the amount of 

transcription in either the same cell in the absence of the test compound, or it may be 
compared with the amount of transcription in a substantially identical cell that lacks TC- 
ICS. A substantially identical cell may be derived from the same cells from which the 
recombinant cell was prepared but which had not been modified by introduction of 
25 heterologous DNA. Any difference in the amount of transcription indicates that the test 
compound has in some manner altered the activity of TC-ICS. 

Other physiological change that affects TC-ICS activity are used to assess 
the influence of a test compound on the polypeptides of this invention. When the 
functional consequences are determined using intact cells or animals, one can also 
30 measure a variety of effects such as transmitter release, hormone release, transcriptional 
changes to both known and uncharacterized genetic markers {e.g., northern blots), 
changes in cell metabolism such as cell growth or pH changes, and the like. 
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In one preferred embodiment, TC-ICS activity is measured by expressing 
TC-ICS in a heterologous cell with a taste cell specific G-protein receptor (TC-GPCR; 
see U.S.S.N. 60/094,465 filed July 28, 1998; U.S.S.N. 60/095,464 filed July 28, 1998; 
U.S.S.N. 60/1 12,747 filed December 17, 1998) and a promiscuous G-protein that links 

5 the receptor to a phospholipase C signal transduction pathway {see Offermanns & Simon, 
J. Biol Chem. 270:15175-15180 (1995); see also Example II). A TC-GPCR, such as 
GPCR-B3 or GPCR-B4, can be used in the assays (see U.S.S.N. 60/094,465 filed July 28, 
1998 for the description of GPCR-B3 and U.S.S.N. 60/095,464 filed July 28, 1998 and 
60/112,747 filed December 17, 1998 for the description of GPCR-B4). Gal4orGal5 

10 can be used as a promiscuous G-protein alpha subunit (Wilkie et al, PNAS USA 

88:10049-10053 (1991)). Such promiscuous G-proteins allow coupling of a wide range 
of receptors. Alternatively, a taste cell specific G-protein alpha subunit can be used, such 
as the Got subunit described in copending application U.S.S.N. 60/1 17,367, TTC ref. no. 
02307E-092600, filed 1/27/99, and U.S.S.N. 60/117,404, TTC ref. no. 02307E-092700, 

15 filed 1/27/99, herein incorporated by reference. Preferably the cell line is HEK-293 
(which does not naturally express GPCR-B4) and the promiscuous G-protein is Gal 5 
(Offermanns & Simon, supra). Modulation of taste transduction is assayed by measuring 
changes in intracellular Ca 2+ levels. Changes in Ca 2+ levels are preferably measured 
using fluorescent Ca 2+ indicator dyes and fluorometric imaging. 

20 

B. Modulators 

The compounds tested as modulators of TC-ICS can be an ion, any small 
chemical compound, or a biological entity, such as a protein (e.g., a GPCR or a GPCR 
binding protein), sugar, nucleic acid or lipid. Alternatively, modulators can be genetically 

25 altered versions of TC-ICS. Typically, test compounds will be small chemical molecules 
and peptides. Essentially any chemical compound can be used as a potential modulator or 
ligand in the assays of the invention, although most often compounds that can be 
dissolved in aqueous or organic (especially DMSO-based) solutions are used. The assays 
are designed to screen large chemical libraries by automating the assay steps and 

30 providing compounds from any convenient source to assays, which are typically run in 
parallel (e.g. , in microtiter formats on microliter plates in robotic assays). It will be 
appreciated that there are many suppliers of chemical compounds, including Sigma (St. 
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Louis, MO), Aldrich (St. Louis, MO), Sigma-Aldrich (St. Louis, MO), Fluka Chemika- 
Biochemica Analytika (Buchs Switzerland) and the like. 

In one preferred embodiment, high throughput screening methods involve 
providing a combinatorial chemical or peptide library containing a large number of 
5 potential therapeutic compounds (potential modulator or ligand compounds). Such 

"combinatorial chemical libraries" or "ligand libraries" are then screened in one or more 
assays, as described herein, to identify those library members (particular chemical species 
or subclasses) that display a desired characteristic activity. The compounds thus 
identified can serve as conventional "lead compounds" or can themselves be used as 
10 potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis, by combining 
a number of chemical "building blocks" such as reagents. For example, a linear 
combinatorial chemical library such as a polypeptide library is formed by combining a set 
15 of chemical building blocks (amino acids) in every possible way for a given compound 

length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks. 

Preparation and screening of combinatorial chemical libraries is well 
20 known to those of skill in the art. Such combinatorial chemical libraries include, but are 
not limited to, peptide libraries (see, e.g,, U.S. Patent 5,010,175, Furka, Int. J. Pept. Prot. 
Res. 37:487-493 (1991) and Houghton et al, Nature 354:84-88 (1991)). Other 
chemistries for generating chemical diversity libraries can also be used. Such chemistries 
include, but are not limited to: peptoids (e.g., PCT Publication No. WO 91/19735), 

25 encoded peptides (e.g., PCT Publication No. WO 93/20242), random bio-oligomers (e.g., 
PCT Publication No. WO 92/00091), benzodiazepines (e.g., U.S. Pat. No. 5,288,514), 
diversomers such as hydantoins, benzodiazepines and dipeptides (Hobbs et al.,Proc. Nat. 
Acad. Sci. USA 90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et al, J. Amer. 
Chem. Soc. 1 14:6568 (1992)), nonpeptidal peptidomimetics with glucose scaffolding 

30 (Hirschmann et al, J. Amer. Chem. Soc. 114:9217-9218 (1992)), analogous organic 
syntheses of small compound libraries (Chen et al, J. Amer. Chem. Soc. 1 16:2661 
(1994)), oligocarbamates (Cho et al, Science 261:1303 (1993)), and/or peptidyl 
phosphonates (Campbell et al, J. Org. Chem. 59:658 (1994)), nucleic acid libraries (see 
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Ausubel, Berger and Sambrook, all supra), peptide nucleic acid libraries {see, e.g., U.S. 
Patent 5,539,083), antibody libraries (see, e.g., Vaughn et al, Nature Biotechnology, 
14(3):309-314 (1996) and PCT/US96/1 0287), carbohydrate libraries (see, e.g., Liang et 
al, Science, 274:1520-1522 (1996) and U.S. Patent 5,593,853), small organic molecule 
5 libraries (see, e.g., benzodiazepines, Baum C&EN, Jan 18, page 33 (1993); isoprenoids, 
U.S. Patent 5,569,588; thiazolidinones and metathiazanones, U.S. Patent 5,549,974; 
pyrrolidines, U.S. Patents 5,525,735 and 5,519,134; morpholino compounds, U.S. Patent 
5,506,337; benzodiazepines, 5,288,514, and the like). 

Devices for the preparation of combinatorial libraries are commercially 
10 available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, 

Symphony, Rainin, Wobum, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, 
Millipore, Bedford, MA). In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, 
Tripos, Inc., St. Louis, MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, 
15 PA, Martek Biosciences, Columbia, MD, etc.). 

In one embodiment, the invention provides solid phase based in vitro 
assays in a high throughput format, where the cell or tissue expressing TC-ICS is attached 
to a solid phase substrate. In the high throughput assays of the invention, it is possible to 
screen up to several thousand different modulators or ligands in a single day. In 
20 particular, each well of a microtiter plate is used to run a separate assay against a selected 
potential modulator, or, if concentration or incubation time effects is to be observed, 
every 5-10 wells can test a single modulator. Thus, a single standard microtiter plate can 
assay about 100 (e.g., 96) modulators. If 1536 well plates are used, then a single plate 
can easily assay from about 100- about 1500 different compounds. It is possible to assay 
25 several different plates per day; assay screens for up to about 6,000-20,000 different 
compounds is possible using the integrated systems of the invention. More recently, 
microfluidic approaches to reagent manipulation have been developed, e.g., by Caliper 
Technologies (Palo Alto, CA). 

30 C. Computer-based assays 

Yet another assay for compounds that modulate TC-ICS activity involves 
computer assisted drug design, in which a computer system is used to generate a three- 
dimensional structure of TC-ICS based on the structural information encoded by the 
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amino acid sequence. The input amino acid sequence interacts directly and actively with 
a pre-established algorithm in a computer program to yield secondary, tertiary, and 
quaternary structural models of the protein. The models of the protein structure are then 
examined to identify regions of the structure that have the ability to bind, e.g., ligands. 
5 These regions are then used to identify ligands that bind to the protein. 

The three-dimensional structural model of the protein is generated by 
entering amino acid sequences of at least 10 amino acid residues that are present in IC- 
ICS (for example, SEQ ID NO:2, SEQ ID NO:5, or SEQ ID NO:8 or conservatively 
modified versions thereof) or corresponding nucleic acid sequences encoding a TC-ICS 

10 polypeptide (for example, SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6 or 
SEQ ID NO: 7 or conservatively modified versions thereof) into a computer system.. The 
amino acid sequence represents the primary sequence or subsequence of the protein, 
which encodes the structural information of the protein. At least 10 residues of the amino 
acid sequence (or a nucleotide sequence encoding 10 amino acids) are entered into the 

15 computer system from computer keyboards, computer readable substrates that include, 

but are not limited to, electronic storage media (e.g., magnetic diskettes, tapes, cartridges, 
and chips), optical media (e.g., CD ROM), information distributed by internet sites, and 
by RAM. The three-dimensional structural model of the protein is then generated by the 
interaction of the amino acid sequence and the computer system, using software known to 

20 those of skill in the art. The three-dimensional structural model of the protein is saved to 
a computer readable form and be used for further analysis (e.g., identifying potential 
ligand binding regions of the protein and screening for mutations, alleles and interspecies 
homologs of the gene). 

The amino acid sequence represents a primary structure that encodes the 

25 information necessary to form the secondary, tertiary and quaternary structure of the 
protein of interest. The software looks at certain parameters encoded by the primary 
sequence to generate the structural model. These parameters are referred to as "energy 
terms," and primarily include electrostatic potentials, hydrophobic potentials, solvent 
accessible surfaces, and hydrogen bonding. Secondary energy terms include van der 

30 Waals potentials. Biological molecules form the structures that minimize the energy 
terms in a cumulative fashion. The computer program is therefore using these terms 
encoded by the primary structure or amino acid sequence to create the secondary 
structural model. 
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The tertiary structure of the protein encoded by the secondary structure is 
then formed on the basis of the energy terms of the secondary structure. The user at this 
point can enter additional variables such as whether the protein is membrane bound or 
soluble, its location in the body, and its cellular location, e.g., cytoplasmic, surface, or 

5 nuclear. These variables along with the energy terms of the secondary structure are used 
to form the model of the tertiary structure. In modeling the tertiary structure, the 
computer program matches hydrophobic faces of secondary structure with like, and 
hydrophilic faces of secondary structure with like. 

Once the structure has been generated, potential ligand binding regions are 

1 0 identified by the computer system. Three-dimensional structures for potential ligands are 
generated by entering amino acid or nucleotide sequences or chemical formulas of 
compounds, as described above. The three-dimensional structure of the potential ligand 
is then compared to that of the TC-ICS protein to identify ligands that bind to TC-ICS. 
Binding affinity between the protein and ligands is determined using energy terms to 

1 5 determine which ligands have an enhanced probability of binding to the protein. The 
results, such as three-dimensional structures for potential ligands and binding affinity of 
ligands, can also be saved to a computer readable form and is used for further analysis 
{e.g., generating a three dimensional model of mutated proteins having an altered binding 
affinity for a ligand). 

20 Computer systems are also used to screen for mutations, polymorphic 

variants, alleles and interspecies homologs of TC-ICS genes. Such mutations are 
associated with disease states or genetic traits. As described above, high density 
oligonucleotide arrays (GeneChip™) and related technology can also be used to screen for 
mutations, polymorphic variants, alleles and interspecies homologs. Once the variants are 

25 identified, diagnostic assays are used to identify patients having such mutated genes. 

Identification of the mutated TC-ICS genes involves receiving input of a first nucleic acid 
or amino acid sequence encoding selected from the group consisting of, e.g., SEQ ID 
NOS: 1 , 4 or 7, or SEQ ID NOS: 2, 5 or 8, and conservatively modified versions thereof. 
The sequence is entered into the computer system as described above and then saved to a 

30 computer readable form. The first nucleic acid or amino acid sequence is then compared 
to a second nucleic acid or amino acid sequence that has substantial identity to the first 
sequence. The second sequence is entered into the computer system in the manner 
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described above. Once the first and second sequences are compared, nucleotide or amino 
acid differences between the sequences are identified. Such sequences can represent 
allelic differences in TC-ICS genes, and mutations associated with disease states and 
genetic traits. 

III. Isolation of the nucleic acid encoding TC-ICS 

A. General recombinant DNA methods 

This invention relies on routine techniques in the field of recombinant 
genetics. Basic texts disclosing the general methods of use in this invention include 
Sambrook et al. 9 Molecular Cloning, A Laboratoiy Manual (2nd ed. 1989); Kriegler, 
Gene Transfer and Expression: A Laboratory Manual (1990); and Current Protocols in 
Molecular Biology (Ausubel etal, eds., (1994)). 

For nucleic acids, sizes are given in either kilobases (kb) or base pairs 
(bp). These are estimates derived from agarose or acrylamide gel electrophoresis, from 
sequenced nucleic acids, or from published DNA sequences. For proteins, sizes are given 
in kilodaltons (kDa) or amino acid residue numbers. Proteins sizes are estimated from gel 
electrophoresis, from sequenced proteins, from derived amino acid sequences, or from 
published protein sequences. 

Oligonucleotides that are not commercially available can be chemically 
synthesized according to the solid phase phosphoramidite triester method first described 
by Beaucage & Caruthers, Tetrahedron Letts. 22:1859-1862 (1981), using an automated 
synthesizer, as described in Van Devanter et al 9 Nucleic Acids Res. 12:6159-6168 
(1984). Purification of oligonucleotides is by either native acrylamide gel electrophoresis 
or by anion-exchange HPLC as described in Pearson & Reamer, J. Chrom. 255:137-149 
(1983). 

The sequence of the cloned genes and synthetic oligonucleotides can be 
verified after cloning using, e.g., the chain termination method for sequencing double- 
stranded templates of Wallace et al, Gene 16:21-26 (1981). 

B. Cloning methods for the isolation of nucleotide sequences encoding TC- 
ICS 

In general, the nucleic acid sequences encoding TC-ICS and related 
nucleic acid sequence homologs are cloned from cDNA and genomic DNA libraries by 
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hybridization with a probe, or isolated using amplification techniques with 
oligonucleotide primers. For example, TC-ICS sequences are typically" isolated from 
mammalian nucleic acid (genomic or cDNA) libraries by hybridizing with a nucleic acid 
probe, the sequence of which can be derived from SEQ ID NOS: 1, 3, 4, 6, or 7. A 
5 suitable tissue from which TC-ICS and cDNA can be isolated is tongue tissue, preferably 
taste bud tissue, more preferably individual taste cells. For example, circumvallate, 
foliate, fungiform taste receptor cells are used to isolate RNA and cDNA. 

Amplification techniques using primers are also used to amplify and 
isolate TC-ICS from DNA or RNA {see, e.g., Dieffenfach & Dveksler, PCR Primer: A 
10 Laboratory Manual (1995)). These primers are used, eg., to amplify either the full 

length sequence or a probe of one to several hundred nucleotides, which is then used to 
screen a mammalian library for full-length TC-ICS. 

Nucleic acids encoding TC-ICS can also be isolated from expression 
libraries using antibodies as probes. Such polyclonal or monoclonal antibodies can be 
1 5 raised using the sequence of SEQ ID NO:2, SEQ ID NO:5, or SEQ ID NO:8. 

TC-ICS polymorphic variants, alleles, and interspecies homologs that are 
substantially identical to TC-ICS are isolated using TC-ICS nucleic acid probes and 
oligonucleotides under stringent hybridization conditions, by screening libraries. 
Alternatively, expression libraries are used to clone TC-ICS and its polymorphic variants, 
20 alleles, and interspecies homologs, by detecting expressed homologs immunologically 
with antisera or purified antibodies made against TC-ICS which also recognize and 
selectively bind to the TC-ICS homolog. 

To make a cDNA library, one should choose a source that is rich in TC- 
ICS mRNA, e.g., tongue tissue, or isolated taste buds. The mRNA is then made into 
25 cDNA using reverse transcriptase, ligated into a recombinant vector, and transfected into 
a recombinant host for propagation, screening and cloning. Methods for making and 
screening cDNA libraries are well known (see, e.g., Gubler & Hoffman, Gene 25:263-269 
(1983); Sambrook et al, supra; Ausubel et al, supra). 

For a genomic library, the DNA is extracted from the tissue and either 
30 mechanically sheared or enzymatically digested to yield fragments of about 12-20 kb. 

The fragments are then separated by gradient centrifugation from undesired sizes and are 
constructed in bacteriophage lambda vectors. These vectors and phage are packaged in 
vitro. Recombinant phage are analyzed by plaque hybridization as described in Benton & 
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Davis, Science 196:180-182 (1977). Colony hybridization is carried out as generally 
described in Grunstein et aL, Proc. Natl Acad, Set USA., 72:3961-3965 (1975). 

An alternative method of isolating TC-ICS nucleic acid and its homologs 
combines the use of synthetic oligonucleotide primers and amplification of an RNA or 
DNA template (see U.S. Patents 4,683,195 and 4,683,202; PCR Protocols: A Guide to 
Methods and Applications (Innis et aL, eds, 1990)). Methods such as polymerase chain 
reaction (PCR) and ligase chain reaction (LCR) are used to amplify nucleic acid 
sequences of TC-ICS directly from mRNA, from cDNA, from genomic libraries or cDNA 
libraries. Degenerate oligonucleotides can be designed to amplify TC-ICS homologs 
using the sequences provided herein. Restriction endonuclease sites can be incorporated 
into the primers. Polymerase chain reaction or other in vitro amplification methods may 
also be useful, for example, to clone nucleic acid sequences that code for proteins to be 
expressed, to make nucleic acids to use as probes for detecting the presence of TC-ICS 
encoding mRNA in physiological samples, for nucleic acid sequencing, or for other 
purposes. Genes amplified by the PCR reaction can be purified from agarose gels and 
cloned into an appropriate vector. 

Gene expression of TC-ICS can also be analyzed by techniques known in 
the art, e.g., reverse transcription and amplification of mRNA, isolation of total RNA or 
poly A + RNA, northern blotting, dot blotting, in situ hybridization, RNase protection, and 
the like. In one embodiment, high density oligonucleotide arrays technology (e.g., 
GeneChip™) is used to identify homologs and polymorphic variants of the TC-ICS of the 
invention (see t e.g., Gunthand et al t AIDS Res. Hum. Retroviruses 14:869-876 (1998); 
Kozal et aL, Nat. Med. 2:753-759 (1996); Matson et aL, Anal. Biochem. 224:101-106 
(1995); Lockhart et aL, Nat. BiotechnoL 14:1675-1680 (1996); Gingeras et aL, Genome 
Res. 8:435-448 (1998); Hacia et aL, Nucleic Acids Res. 26:3865-3866 (1998)). 

Synthetic oligonucleotides are used to construct recombinant TC-ICS 
genes for use as probes or for expression of protein. This method is performed using a 
series of overlapping oligonucleotides usually 40-120 bp in length, representing both the 
sense and nonsense strands of the gene. These DNA fragments are then annealed, ligated 
and cloned. Alternatively, amplification techniques are used with precise primers to 
amplify a specific subsequence of the TC-ICS nucleic acid. The specific subsequence is 
then ligated into an expression vector. 
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The nucleic acid encoding TC-ICS is typically cloned into intermediate 
vectors before transformation into prokaryotic or eukaryotic cells for replication and/or 
expression. These intermediate vectors are typically prokaryote vectors, e.g., plasmids, or 
shuttle vectors. 

5 

C. Expression in prokaryotes and eukaryotes 

To obtain high level expression of a cloned gene or nucleic acid, such as 
those cDNAs encoding TC-ICS, one typically subclones TC-ICS into an expression 
vector that contains a strong promoter to direct transcription, a transcription/translation 

10 terminator, and if for a nucleic acid encoding a protein, a ribosome binding site for 
translational initiation. Suitable bacterial promoters are well known in the art and 
described, e.g., in Sambrook et al. and Ausubel et al Bacterial expression systems for 
expressing the TC-ICS proteins are available in, e.g., E. coli, Bacillus sp., and Salmonella 
(Palva et al, Gene 22:229-235 (1983); Mosbach et al, Nature 302:543-545 (1983)). Kits 

15 for such expression systems are commercially available. Eukaryotic expression systems 
for mammalian cells, yeast, and insect cells are well known in the art and are also 

commercially available. 

The promoter used to direct expression of a heterologous nucleic acid 
depends on the particular application. The promoter is preferably positioned about the 
20 same distance from the heterologous transcription start site as it is from the transcription 
start site in its natural setting. As is known in the art, however, some variation in this 
distance can be accommodated without loss of promoter function. 

In addition to the promoter, the expression vector typically contains a 
transcription unit or expression cassette that contains all the additional elements required 
25 for the expression ofthe TC-ICS encoding nucleic acid in host cells. A typical expression 
cassette thus contains a promoter operably linked to the nucleic acid sequence encoding 
TC-ICS and signals required for efficient polyadenylation of the transcript, ribosome 
binding sites, and translation termination. The nucleic acid sequence encoding TC-ICS 
may typically be linked to a cleavable signal peptide sequence to promote secretion ofthe 
30 encoded protein by the transformed cell. Such signal peptides would include, among 

others, the signal peptides from tissue plasminogen activator, insulin, and neuron growth 
factor, and juvenile hormone esterase of Heliothis virescens. Additional elements ofthe 
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cassette may include enhancers and, if genomic DNA is used as the structural gene, 
introns with functional splice donor and acceptor sites. 

In addition to a promoter sequence, the expression cassette should also 
contain a transcription termination region downstream of the structural gene to provide 
5 for efficient termination. The termination region may be obtained from the same gene as 
the promoter sequence or may be obtained from different genes. 

The particular expression vector used to transport the genetic information 
into the cell is not particularly critical. Any of the conventional vectors used for 
expression in eukaryotic or prokaryotic cells may be used. Standard bacterial expression 
10 vectors include plasmids such as pBR322 based plasmids, pSKF, pET23D, and fusion 
expression systems such as GST and LacZ. Epitope tags can also be added to 
recombinant proteins to provide convenient methods of isolation, e.g., c-myc. 

Expression vectors containing regulatory elements from eukaryotic viruses 
are typically used in eukaryotic expression vectors, e.g., SV40 vectors, papilloma virus 
15 vectors, and vectors derived from Epstein-Barr virus. Other exemplary eukaryotic 

vectors include pMSG, pAV009/A + , P MTO10/A + , pMAMneo-5, baculovirus pDSVE, 
and any other vector allowing expression of proteins under the direction of the SV40 
early promoter, S V40 later promoter, metallothionein promoter, murine mammary tumor 
virus promoter, Rous sarcoma virus promoter, polyhedrin promoter, or other promoters 
20 shown effective for expression in eukaryotic cells. 

Some expression systems have markers that provide gene amplification 
such as thymidine kinase, hygromycin B phosphotransferase, and dihydrofolate reductase. 
Alternatively, high yield expression systems not involving gene amplification are also 
suitable, such as using a baculovirus vector in insect cells, with a TC-ICS encoding 
25 sequence under the direction of the polyhedrin promoter or other strong baculovirus 
promoters. 

The elements that are typically included in expression vectors also include 
a replicon that functions in E. coli, a gene encoding antibiotic resistance to permit 
selection of bacteria that harbor recombinant plasmids, and unique restriction sites in 
30 nonessential regions of the plasmid to allow insertion of eukaryotic sequences. The 
particular antibiotic resistance gene chosen is not critical, any of the many resistance 
genes known in the art are suitable. The prokaryotic sequences are preferably chosen 
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such that they do not interfere with the replication of the DNA in eukaryotic cells, if 
necessary. 

Standard transfection methods are used to produce bacterial, mammalian, 
yeast or insect cell lines that express large quantities of TC-ICS, which are then purified 
5 using standard techniques (see, e.g., Colley et al, J. Biol Chem. 264:17619-17622 

( 19S9); Guide to Protein Purification, in Methods in Enzymology, vol. 182 (Deutscher, 
ed., 1 990)). Transformation of eukaryotic and prokaryotic cells are performed according 
io standard techniques (see, e.g., Morrison, J. Bact. 132:349-351 (1977); Clark-Curtiss & 
Curtiss, Methods in Enzymology 101 :347-362 (Wu et al, eds, 1983)). 
10 Any of the well known procedures for introducing foreign nucleotide 

sequences into host cells may be used. These include the use of calcium phosphate 
transfection, polybrene, protoplast fusion, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing 
cloned genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host 
15 celiac, e.g.^zmhrooketal., supra). It is only necessary that the particular genetic 

engineering procedure used be capable of successfully introducing at least one gene into 
the host cell capable of expressing TC-ICS. 

After the expression vector is introduced into the cells, the transfected cells 
are cultured under conditions favoring expression of TC-ICS, which is recovered from the 
20 culture using standard techniques identified below. 

IV. Purification of TC-ICS 

Either naturally occurring or recombinant TC-ICS can be purified for use 
in functional assays. Preferably, recombinant TC-ICS is purified. Naturally occurring 
25 TC-ICS is purified, e.g., from mammalian tissue such as tongue tissue, and any other 
source of a TC-ICS homolog. Recombinant TC-ICS is purified from any suitable 

expression system. 

TC-ICS may be purified to substantial purity by standard techniques, 
including selective precipitation with such substances as ammonium sulfate; column 
30 chromatography, immunopurification methods, and others (see, e.g., Scopes, Protein 
Purification: Principles and Practice (1982); U.S. Patent No. 4,673,641; Ausubel et al, 
supra; and Sambroolc et al, supra). 
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A number of procedures can be employed when recombinant TC-ICS is 
being purified. For example, proteins having established molecular adhesion properties 
can be reversible fused to TC-ICS. With the appropriate ligand, TC-ICS can be 
selectively adsorbed to a purification column and then freed from the column in a 
5 relatively pure form. The fused protein is then removed by enzymatic activity. Finally 
TC-ICS could be purified using inmiunoaffinity columns. 

A. Purification o/TC-ICS from recombinant bacteria 
Recombinant proteins are expressed by transformed bacteria in large 
10 amounts, typically after promoter induction; but expression can be constitutive. Promoter 
induction with IPTG is a one example of an inducible promoter system. Bacteria are 
grown according to standard procedures in the art. Fresh or frozen bacteria cells are used 

for isolation of protein. 

Proteins expressed in bacteria may form insoluble aggregates ("inclusion 
15 bodies"). SeveralprotocolsaresuitableforpurificationofTC-ICSinclusionbodies. For 
example, purification of inclusion bodies typically involves the extraction, separation 
and/or purification of inclusion bodies by disruption of bacterial cells, e.g., by incubation 
in a buffer of 50 mM TRIS/HCL P H 7.5, 50 mM NaCl, 5 mM MgCl 2 , 1 mM DTT, 0.1 
mM ATP, and 1 mM PMSF. The cell suspension can be lysed using 2-3 passages 
9 0 through a French Press, homogenized using a Polytron (Brinkman Instruments) or 

sonicated on ice. Alternate methods oflysing bacteria are apparent to those of skill in the 
art {see, e.g., Sambrook et al, supra; Ausubel et al, supra). 

If necessary, the inclusion bodies are solubilized, and the lysed cell 
suspension is typically centrifuged to remove unwanted insoluble matter. Proteins that 
25 formed the inclusion bodies may be renatured by dilution or dialysis with a compatible 
buffer. Suitable solvents include, but are not limited to urea (from about 4 M to about 8 
M), formamide (at least about 80%, volume/volume basis), and guanidine hydrochloride 
(from about 4 M to about 8 M). Some solvents which are capable of solubilizing 
aggregate-forminG-proteins, for example SDS (sodium dodecyl sulfate), 70% formic 
30 acid, are inappropriate for use in this procedure due to the possibility of irreversible 

denaturation of the proteins, accompanied by a lack of immunogenicity and/or activity. 
Although guanidine hydrochloride and similar agents are denaturants, this denaturation is 
not irreversible and restoration may occur upon removal (by dialysis, for example) or 
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dilution of the denaturant, allowing re-formation of immunologically and/or biologically 
active protein. Other suitable buffers are known to those skilled in the art. TC-ICS is 
separated from other bacterial proteins by standard separation techniques, e.g., with Ni- 
NTA agarose resin. 

5 Alternatively, it is possible to purify TC-ICS from bacteria periplasm. 

After lysis of the bacteria, when TC-ICS is exported into the periplasm of the bacteria, the 
periplasms fraction of the bacteria can be isolated by cold osmotic shock in addition to 
other methods known to skill in the art. To isolate recombinant proteins from the 
periplasm, the bacterial cells are centrifuged to form a pellet. The pellet is resuspended in 

10 a buffer containing 20% sucrose. To lyse the cells, the bacteria are centrifuged and the 
pellet is resuspended in ice-cold.5 mM MgS0 4 and kept in an ice bath for approximately 
10 minutes. The cell suspension is centrifuged and the supernatant decanted and saved. 
The recombinant proteins present in the supernatant can be separated from the host 
proteins by standard separation techniques well known to those of skill in the art. 

15 

B. Standard protein separation techniques for purifying TC-ICS 

Solubility fractionation 
Often as an initial step, particularly if the protein mixture is complex, an 
initial salt fractionation can separate many of the unwanted host cell proteins (or proteins 
20 derived from the cell culture media) from the recombinant protein of interest. The 
preferred salt is ammonium sulfate. Ammonium sulfate precipitates proteins by 
effectively reducing the amount of water in the protein rnixture. Proteins then precipitate 
on the basis of their solubility. The more hydrophobic a protein is, the more likely it is to 
precipitate at lower ammonium sulfate concentrations. A typical protocol includes adding 
25 saturated ammonium sulfate to a protein solution so that the resultant ammonium sulfate 
concentration is between 20-30%. This concentration will precipitate the most 
hydrophobic of proteins. The precipitate is then discarded (unless the protein of interest 
is hydrophobic)' and ammonium sulfate is added to the supernatant to a concentration 
known to precipitate the protein of interest. The precipitate is then solubilized in buffer 
30 and the excess salt removed if necessary, either through dialysis or diafiltration. Other 
methods that rely on solubility of proteins, such as cold ethanol precipitation, are well 
known to those of skill in the art and can be used to fractionate complex protein mixtures. 
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Si7.e differp ritial filtration 
The molecular weight of TC-ICS are used to isolate them from proteins of 
greater and lesser size using ultrafiltration through membranes of different pore size (for 
example, Amicon or Millipore membranes). As a first step, the protein mixture is 
5 ultrafiltered through a membrane with a pore size that has a lower molecular weight cut- 
off than the molecular weight of the protein of interest. The retentate of the ultrafiltration 
is then ultrafiltered against a membrane with a molecular cut off greater than the 
molecular weight of the protein of interest. The recombinant protein will pass through 
the membrane into the filtrate. The filtrate can then be chromatographed as described 
10 below. 

Column r.liromatographv 
TC-ICS can also be separated from other proteins on the basis of its size, 
net surface charge, hydrophobic^, and affinity for ligands. In addition, antibodies raised 
15 against proteins can be conjugated to column matrices and the proteins immunopurified. 
All of these methods are well known in the art. It will be apparent to one of skill that 
chromatographic techniques are performed at any scale and using equipment from many 
different manufacturers (e.g., Pharmacia Biotech). 

20 V. Immunological detection of TC-ICS 

In addition to the detection of TC-ICS genes and gene expression using 
nucleic acid hybridization technology, one can also use immunoassays to detect TC-ICS, 
e.g., to identify taste receptor cells and variants of TC-ICS. Immunoassays can be used to 
qualitatively or quantitatively analyze TC-ICS. A general overview of the applicable 
25 technology can be found in Harlow & Lane, Antibodies: A Laboratory Manual (1988). 

A. Antibodies to TC-ICS 

Methods of producing polyclonal and monoclonal antibodies that react 
specifically with TC-ICS are known to those of skill in the art (see, e.g., Coligan, Cwrent 
30 Protocols in Immunology (1991); Harlow & Lane, supra; Coding, Monoclonal 

Antibodies: Principles and Practice (2d ed. 1986); and Kohler & Milstein, Nature 
256:495-497 (1975)- Such techniques include antibody preparation by selection of 
antibodies from libraries of recombinant antibodies in phage or similar vectors, as well as 
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preparation of polyclonal and monoclonal antibodies by immunizing rabbits or mice {see, 
e.g., Huse et ah, Science 246:1275-1281 (1989); Ward et al, Nature 341:544-546 
(1989)). 

A number of TC-ICS-comprising immunogens may be used to produce 
5 antibodies specifically reactive with TC-ICS. For example, recombinant TC-ICS or an 
antigenic fragment thereof, is isolated as described herein. Recombinant protein are 
expressed in eukaryotic or prokaryotic cells as described above, and purified as generally 
described above. Recombinant protein is the preferred immunogen for the production of 
monoclonal or polyclonal antibodies. Alternatively, a synthetic peptide derived from the 
10 sequences disclosed herein and conjugated to a carrier protein can be used as an 

immunogen. Naturally occurring TS-ICS may also be used either in pure or impure form. 
The product is then injected into an animal capable of producing antibodies. Either 
monoclonal or polyclonal antibodies are generated, for subsequent use in immunoassays 
to measure the protein. 

15 Methods of production of polyclonal antibodies are known to those of skill 

intheart. An inbred strain of mice (e.g., BALB/C mice) or rabbits is irnmunized with the 
protein using a standard adjuvant, such as Freund's adjuvant, and a standard 
immunization protocol. The animal's immune response to the immunogen preparation is 
monitored by taking test bleeds and determining the titer of reactivity to TC-ICS. When 
20 appropriately high titers of antibody to the immunogen are obtained, blood is collected 
from the animal and antisera are prepared. Further fractionation of the antisera to enrich 
for antibodies reactive to the protein can be done if desired (see Harlow & Lane, supra). 

Monoclonal antibodies may be obtained by various techniques famihar to 
those skilled in the art. Briefly, spleen cells from an animal immunized with a desired 
25 antigen are immortalized, commonly by fusion with a myeloma cell (see Kohler & 

Milstein, Eur. J. Immunol. 6:511-519 (1976)). Alternative methods of immortalization 
include transformation with Epstein Barr Virus, oncogenes, or retroviruses, or other 
methods well known in the art. Colonies arising from single immortalized cells are 
screened for production of antibodies of the desired specificity and affinity for the 
30 antigen, and yield of the monoclonal antibodies produced by such cells may be enhanced 
by various techniques, including injection into the peritoneal cavity of a vertebrate host 
Alternatively, one may isolate DNA sequences which encode a monoclonal antibody or a 
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binding fragment thereof by screening a DNA library from human B cells according to 
the general protocol outlined by Huse et at, Science 246:1275-1281 (1989). 

Monoclonal antibodies and polyclonal sera are collected and titered 
against the immunogen protein in an immunoassay, for example, a solid phase 

5 immunoassay with the immunogen immobilized on a solid support. Typically, polyclonal 
antisera with a titer of 10 4 or greater are selected and tested for their cross reactivity 
against non-TC-ICS proteins or even other related proteins from other organisms, using a 
competitive binding immunoassay. Specific polyclonal antisera and monoclonal 
antibodies will usually bind with a K<, of at least about 0.1 mM, more usually at least 

10 about 1 uM, preferably at least about 0.1 uM or better, and most preferably, 0.01 ]M or 
better. 

Once TC-ICS specific antibodies are available, TC-ICS are detected by a 
variety of immunoassay methods. For a review of immunological and immunoassay 
procedures, see Basic and Clinical Immunology (Stites & Terr eds., 7th ed. 1991). 
15 Moreover, the immunoassays of the present invention are performed in any of several 
configurations, which are reviewed extensively in Enzyme Immunoassay (Maggio, ed., 
1980); and Harlow & Lane, supra. 

B. Immunological binding assays 
2 0 TC-ICS are detected and/or quantified using any of a number of well 

recognized immunological binding assays {see, e.g., U.S. Patents 4,366,241; 4,376,1 10; 
4,517,288; and 4,837,168). For a review of the general immunoassays, see also, Methods 
in Cell Biology: Antibodies in Cell Biology, volume 37 (Asai, ed. 1993); Basic and 
Clinical Immunology (Stites & Terr, eds., 7th ed. 1991). Immunological binding assays 
25 (or immunoassays) typically use an antibody that specifically binds to a protein or antigen 
of choice (in this case the TC-ICS or antigenic subsequence thereof). The antibody {e.g., 
anti-TC-ICS) may be produced by any of a number of means well known to those of skill 

in the art and as described above. 

Immunoassays also often use a labeling agent to specifically bind to and 
30 label the complex formed by the antibody and antigen. The labeling agent may itself be 
one of the moieties comprising the antibody/antigen complex. Thus, the labeling agent 
may be a labeled polypeptide of TC-ICS or a labeled anti-TC-ICS antibody. 
Alternatively, the labeling agent may be a third moiety, such a secondary antibody, that 
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specifically binds to the antibody/TC-ICS complex (a secondary antibody is typically 
specific to antibodies of the species from which the first antibody is derived). Other 
proteins capable of specifically binding immunoglobulin constant regions, such as protein 
A or protein G may also be used as the label agent. These proteins exhibit a strong non- 
5 immunogenic reactivity with immunoglobulin constant regions from a variety of species 
{see, e.g., Kronval et al, J. Immunol 111:1401-1406 (1973); Akerstrom et al.,J. 
Immunol. 135:2539-2542 (1985)). The labeling agent can be modified with a detectable 
moiety, such as biotin, to which another molecule can specifically bind, such as 
streptavidin. A variety of detectable moieties are well known to those skilled in the art. 
10 Throughout the assays, incubation and/or washing steps may be required 

after each combination of reagents. Incubation steps can vary from about 5 seconds to 
several hours, preferably from about 5 minutes to about 24 hours. However, the 
incubation time will depend upon the assay format, antigen, volume of solution, 
concentrations, and the like. Usually, the assays will be carried out at ambient 
15 temperature, although they can be conducted over a range of temperatures, such as 10°C 
to 40°C. 

Non-comr>fttitive ass ay formats 
Immunoassays for detecting TC-ICS in samples may be either competitive 
20 or noncompetitive. Noncompetitive immunoassays are assays in which the amount of 
antigen is directly measured. In one preferred "sandwich" assay, for example, the anti- 
TC-ICS antibodies can be bound directly to a solid substrate on which they are 
immobilized. These immobilized antibodies then capture TC-ICS present in the test 
sample. TC-ICS is thus immobilized is then bound by a labeling agent, such as a second 
25 TC-ICS antibody bearing a label. Alternatively, the second antibody may lack a label, 
but it may, in turn, be bound by a labeled third antibody specific to antibodies of the 
species from which the second antibody is derived. The second or third antibody is 
typically modified with a detectable moiety, such as biotin, to which another molecule 
specifically binds, e.g., streptavidin, to provide a detectable moiety. 



30 
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Com petitive assay formats 
In competitive assays, the amount of TC-ICS present in the sample is 
measured indirectly by measuring the amount of a known, added (exogenous) TC-ICS 
displaced (competed away) from an anti-TC-ICS antibody by the unknown TC-ICS 

5 present in a sample. In one competitive assay, a known amount of TC-ICS is added to a 
sample and the sample is then contacted with an antibody that specifically binds to TC- 
ICS. The amount of exogenous TC-ICS bound to the antibody is inversely proportional 
to the concentration of TC-ICS present in the sample. In a particularly preferred 
embodiment, the antibody is immobilized on a solid substrate. The amount of TC-ICS 

10 bound to the antibody may be determined either by measuring the amount of TC-ICS 
present in a TC-ICS/antibody complex, or alternatively by measuring the amount of 

remaining uncomplexed protein. The amount of TC-ICS may be detected by providing a 

labeled TC-ICS molecule. 

A hapten inhibition assay is another preferred competitive assay. In this 
15 assay the known TC-ICS is immobilized on a solid substrate. A known amount of anti- 
TC-ICS antibody is added to the sample, and the sample is then contacted with the 
immobilized TC-ICS. The amount of anti-TC-ICS antibody bound to the known 
immobilized TC-ICS is inversely proportional to the amount of TC-ICS present in the 
sample. Again, the amount of immobilized antibody may be detected by detecting either 
20 the immobilized fraction of antibody or the fraction of the antibody that remains in 
solution. Detection may be direct where the antibody is labeled or indirect by the 
subsequent addition of a labeled moiety that specifically binds to the antibody as 
described above. 

25 Cross-react ivity determinations 

Immunoassays in the competitive binding format can also be used for 
cross-reactivity determinations. For example, a protein at least partially encoded by SEQ 
ID NO:2, SEQ ID NO:5, or SEQ ID NO:8 can be immobilized to a solid support. 
Proteins (e.g., TC-ICS proteins and homologs) are added to the assay that compete for 
30 binding of the antisera to the immobilized antigen. The ability of the added proteins to 
compete for binding of the antisera to the immobilized protein is compared to the ability 
of TC-ICS encoded by SEQ ID NOS: 1, 3, 4, 6 or 7 to compete with itself. The percent 
cross-reactivity for the above proteins is calculated, using standard calculations. Those 
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antisera with less than 10% cross-reactivity with each of the added proteins listed above 
are selected and pooled. The cross-reacting antibodies are optionally removed from the 
pooled antisera by immunoabsorption with the added considered proteins, e.g., distantly 
related homologs. 

5 The immunoabsorbed and pooled antisera are then used in a competitive 

binding immunoassay as described above to compare a second protein, thought to be 
perhaps an allele or polymorphic variant of TC-ICS to the immunogen protein (i.e., TC- 
ICS of SEQ ID NO:2, SEQ ID NO:5, or SEQ ID NO:8). In order to make this 
comparison, the two proteins are each assayed at a wide range of concentrations and the 

10 amount of each protein required to inhibit 50% of the binding of the antisera to the 

immobilized protein is determined. Ifthe amount of the second protein required to inhibrf 
50% of binding is less than 10 times the amount of the protein encoded by SEQ ID NO:2, 
SEQ ID NO:5, or SEQ ID NO:8 that is required to inhibit 50% of binding, then the 
second protein is said to specifically bind to the polyclonal antibodies generated to a TC- 

15 ICS immunogen. 

Other assav formats 
Western blot (immunoblot) analysis is used to detect and quantify the 
presence of TC-ICS in the sample. The technique generally comprises separating sample 
20 proteins by gel electrophoresis on the basis of molecular weight, transferring the 

separated proteins to a suitable solid support, (such as a nitrocellulose filter, a nylon filter, 
or derivatized nylon filter), and incubating the sample with the antibodies that specially 
bind TC-ICS. The anti-TC-ICS antibodies specifically bind to the TC-ICS on the solid 
support. These antibodies may be directly labeled or alternatively may be subsequently 
25 detected using labeled antibodies (e.g., labeled sheep anti-mouse antibodies) that 
specifically bind to the anti-TC-ICS antibodies. 

Other assay formats include liposome immunoassays (LIA), which use 
liposomes designed to bind specific molecules (e.g., antibodies) and release encapsulated 
reagents or markers. The released chemicals are then detected according to standard 
30 techniques (see Monroe et al, Amer. Clin. Prod. Rev. 5 :34-41 (1986)). 



WO 02/054069 PCT/US01/49808 

47 

Reduction of non-specific binding 
One of skill in the art will appreciate that it is often desirable to minimize 
non-specific binding in immunoassays. Particularly, where the assay involves an antigen 
or antibody immobilized on a solid substrate it is desirable to minimize the amount of 
non-specific binding to the substrate. Means of reducing such non-specific binding are 
well known to those of skill in the art. Typically, this technique involves coating the 
substrate with a proteinaceous composition. In particular, protein compositions such as 
bovine serum albumin (BSA), nonfat powdered milk, and gelatin are widely used with 
powdered milk being most preferred. 
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Labels 

The particular label or detectable group used in the assay is not a critical 
aspect of the invention, as long as it does not significantly interfere with the specific 
binding of the antibody used in the assay. The detectable group can be any material 

15 having a detectable physical or chemical property. Such detectable labels have been well- 
developed in the field of immunoassays and, in general, most any label useful in such 
methods can be applied to the present invention. Thus, a label is any composition 
detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, 
optical or chemical means. Useful labels in the present invention include magnetic beads 

20 (e.g. , DYNABEADS™), fluorescent dyes {e.g., fluorescein isothiocyanate, Texas red, 
rhodamine, and the like), radiolabels (e.g., 3 H, 125 I, 35 S, 14 C, or 32 P), enzymes (e.g., horse 
radish peroxidase, alkaline phosphatase and others commonly used in an ELISA), and 
colorimetric labels such as colloidal gold or colored glass or plastic beads (e.g., 
polystyrene, polypropylene, latex, etc.). 

25 The label may be coupled directly or indirectly to the desired component 

of the assay according to methods well known in the art. As indicated above, a wide 
variety of labels may be used, with the choice of label depending on sensitivity required, 
ease of conjugation with the compound, stability requirements, available instrumentation, 

and disposal provisions. 
30 Non-radioactive labels are often attached by indirect means. Generally, a 

ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then binds 
to another molecules (e.g., streptavidin) molecule, which is either inherently detectable or 
covalently bound to a signal system, such as a detectable enzyme, a fluorescent 
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compound, or a chemiluminescent compound. The ligands and their targets can be used 
in any suitable combination with antibodies that recognize TC-ICS, or secondary 
antibodies that recognize anti-TC-ICS. 

The molecules can also be conjugated directly to signal generating 

5 compounds, e.g., by conjugation with an enzyme or fluorophore. Enzymes of interest as 
labels will primarily be hydrolases, particularly phosphatases, esterases and glycosidases, 
or oxidotases, particularly peroxidases. Fluorescent compounds include fluorescein and 
its derivatives, rhodamine and its derivatives, dansyl, umbelliferone, etc. 
Chemiluminescent compounds include luciferin, and 2,3-dihydrophthalazinediones, e.g., 

10 luminol. For a review of various labeling or signal producing systems that may be used, 

see U.S. Patent No. 4,391,904. 

Means of detecting labels are well known to those of skill in the art. Thus, 
for example, where the label is a radioactive label, means for detection include a 
scintillation counter or photographic film as in autoradiography. Where the label is a 

15 fluorescent label, it may be detected by exciting the fluorochrome with the appropriate 
wavelength of light and detecting the resulting fluorescence. The fluorescence may be 
detected visually, by means of photographic film, by the use of electronic detectors such 
as charge coupled devices (CCDs) or photomultipliers and the like. Similarly, enzymatic 
labels may be detected by providing the appropriate substrates for the enzyme and 

20 detecting the resulting reaction product. Finally simple colorimetric labels may be 
detected simply by observing the color associated with the label. Thus, in various 
dipstick assays, conjugated gold often appears pink, while various conjugated beads 

appear the color of the bead. 

Some assay formats do not require the use of labeled components. For 
25 instance, agglutination assays can be used to detect the presence of the target antibodies. 
In this case, antigen-coated particles are agglutinated by samples comprising the target 
antibodies. In this format, none of the components need be labeled and the presence of 
the target antibody is detected by simple visual inspection. 

30 VI. Kits 

TC-ICS and its homologs are a useful tool for identifying taste receptor 
cells, for forensics and paternity determinations, and for examining taste transduction 
(e.g., generating a topographical map between the taste cells of the tongue and the 
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corresponding taste centers in the brain). Specific reagents that specifically hybridize to 
TC-ICS nucleic acid, such as its probes and primers, and specific reagents that 
specifically bind to the TC-ICS protein, e.g., their antibodies are used to examine taste 
cell expression and taste transduction regulation. 
5 Nucleic acid assays for the presence of TC-ICS DNA and RNA in a 

sample include numerous techniques are known to those skilled in the art, such as 
Southern analysis, northern analysis, dot blots, RNase protection, high density 
oligonucleotide arrays, SI analysis, amplification techniques such as PCR and LCR, and 
in situ hybridization. In in situ hybridization, for example, the target nucleic acid is 
10 liberated from its cellular surroundings in such as to be available for hybridization within 
the cell while preserving the cellular morphology for subsequent interpretation and 
analysis (see Example I). The following articles provide an overview of the art of in situ 
hybridization: Singer et al, Biotechniques 4:230-250 (1986); Haase et al, Methods in 
Virology, vol. VII, pp. 189-226 (1984); and Nucleic Acid Hybridization: A Practical 
15 Approach (Hames et al, eds. 1987). In addition, TC-ICS protein can be detected with the 
various immunoassay techniques described above. The test sample is typically compared 
to both a positive control {e.g., a sample expressing recombinant TC-ICS) and a negative 
control. 

The present invention also provides for kits for screening for modulators 
20 of TC-ICS. Such kits can be prepared from readily available materials and reagents. For 
example, such kits can comprise any one or more of the following materials: TC-ICS, 
reaction tubes, and instructions for testing TC-ICS activity. Preferably, the kit contains 
biologically active TC-ICS. A wide variety of kits and components can be prepared 
according to the present invention, depending upon the intended user of the kit and the 
25 particular needs of the user. 

VII. Administration and pharmaceutical compositions 

Taste modulators can be administered directly to the mammalian subject 
for modulation of taste in vivo. Administration is by any of the routes normally used for 
30 introducing a modulator compound into ultimate contact with the tissue to be treated, 
preferably the tongue or mouth. The taste modulators are administered in any suitable 
maimer, preferably with pharmaceutically acceptable carriers. Suitable methods of 
administering such modulators are available and well known to those of skill in the art, 
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and, although more than one route can be used to administer a particular composition, a 
particular route can often provide a more immediate and more effective reaction than 
another route. 

Phannaceutically acceptable carriers are determined in part by the 
5 particular composition being administered, as well as by the particular method used to 
administer the composition. Accordingly, there is a wide variety of suitable formulations 
v of pharmaceutical compositions of the present invention (see, e.g., Remington *s 
Pharmaceutical Sciences, 17 th ed. 1985)). 

The taste modulators, alone or in combination with other suitable 
10 components, can be made into aerosol formulations {i.e., they can be "nebulized") to be 
administered via inhalation. Aerosol formulations can be placed into pressurized 
acceptable propellants, such as dichlorodifluoromethane, propane, nitrogen, and the like. 

Formulations suitable for administration include aqueous and non-aqueous 
solutions, isotonic sterile solutions, which can contain antioxidants, buffers, bacteriostats, 
15 and solutes that render the formulation isotonic, and aqueous and non-aqueous sterile 
suspensions that can include suspending agents, solubilizers, thickening agents, 
stabilizers, and preservatives. In the practice of this invention, compositions can be 
administered, for example, by orally, topically, intravenously, intraperitoneally, 
intravesical^ or intrathecally. Preferably, the compositions are administered orally or 
20 nasally. The formulations of compounds can be presented in unit-dose or multi-dose 

sealed containers, such as ampules and vials. Solutions and suspensions can be prepared 
from sterile powders, granules, and tablets of the kind previously described. The 
modulators can also be administered as part a of prepared food or drug. 

The dose administered to a patient, in the context of the present invention 
25 should be sufficient to effect a beneficial response in the subject over time. The dose will 
be determined by the efficacy of the particular taste modulators employed and the 
condition of the subject, as well as the body weight or surface area of the area to be 
treated. The size of the dose also will be determined by the existence, nature, and extent 
of any adverse side-effects that accompany the administration of a particular compound 
30 or vector in a particular subject. 

In determining the effective amount of the modulator to be administered in 
a physician may evaluate circulating plasma levels of the modulator, modulator toxicities, 
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and the production of anti-modulator antibodies. In general, the dose equivalent of a 
modulator is from about 1 ng/kg to 10 mg/kg for a typical subject. 

For administration, taste modulators of the present invention can be 
adrninistered at a rate determined by the LD-50 of the modulator, and the side-effects of 
5 the inhibitor at various concentrations, as applied to the mass and overall health of the 
subject. Administration can be accomplished via single or divided doses. 

All publications and patent applications cited in this specification are 
herein incorporated by reference as if each individual publication or patent application 
10 were specifically and individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by 
way of illustration and example for purposes of clarity of understanding, it will be readily 
apparent to one of ordinary skill in the art in light of the teachings of this invention that 
certain changes and modifications may be made thereto without departing from the spirit 
15 or scope of the appended claims. 

EXAMPLES 

The following examples are provided by way of illustration only and not 
20 by way of limitation. Those of skill in the art will readily recognize a variety of 

noncritical parameters that could be changed or modified to yield essentially similar 
results. 



25 



F.xam ple I: Taste cell sp e r.ifir. expression of TC-ICS and cloning 



Taste bud isolation: 

Subtraction libraries made from rat circumvallate cells were used to isolate 
the TC-ICS nucleic acids of the invention. Briefly, single taste receptor cells were 
isolated from dissociated circumvallate papillae from the rat tongue as generally 
30 described by Bernhardt et al, J. Physiol. (Lond), 490, 325-336 (1996). Amplified single 
cell cDNA was Southern and dot-blotted and probed with radiolabeled probes to identify 
potentially similar cell types. Gustducin, a G-protein specifically expressed in a subset of 
taste receptor cells was chosen as a marker for taste cells (McLaughlin et al, Nature 
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357:563-569 (1992)). Tubulin and N-Cam were chosen to confirm the integrity of the 
cells and validate the amplification reactions. Bacteriophage lambda cDNA libraries 
were then constructed from individual Gustducin positive cells and were plated at low 
density on LB/Agar plates. 

5 

Subtraction and library generation'. 

Total RNA was extracted from circumvallate papillae using standard 
proceduresCTrizol method; GibcoBRL); approximately 2 ug of Poly A+ RNA were 
purified using Qiagen's Oligotex mRNA Kit and used to prime cDNA synthesis 
10 following standard protocols. Samples were stored in media containing RNAse inhibitors 
to prevent degradation of mRNA. 

Suppression subtraction was performed according to Clontech's PCR- 
Select cDNA Subtraction protocol using circumvallate cDNA as tester and non-taste 
cDNA prepared from non-taste lingual tissue as driver. The efficiency of the subtraction 
15 procedure was monitored by probing the subtracted product with two known taste- 
specific genes (Lunch and Repeater); both of these genes were enriched greater than 50 - 
fold in the subtracted cDNA compared to the unsubtracted circumvallate cDNA. 

Sequence analysis and in situ hybridization: 

20 Subtracted cDNAs were cloned into pBluescript-based plasmid vectors to 

generate a subtracted cDNA library. 930 clones from the subtracted library were then 
chosen for sequence analysis. DNA sequences were mined using Blast searches against 
nucleotide and protein databases (http://www.ncbi.nlm.nih.gov/BLAST/). All sequences 
were also analyzed for the presence of potential transmembrane segments 

25 (http://dot.imgen.bcm.^ cDNAs encoding 

novel sequences were used in in situ hybridizations to tongue tissue sections to examine 
taste cell expression. 



30 



Results 

Clone 501-PCR46 showed selective expression in subsets of taste receptor 
cells of circumvallate and fungiform papillae. This clone was chosen for detailed 
characterization, including full-length cDNA isolation and sequencing. The results show 
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that Clone 501-PCR46 represents a mRNA specifically expressed in circumvallate, foliate 
and fungiform taste receptor cells . This is a relatively rare cDNA found in 
approximately 1/100,000 cDNAs from an oligo-dT primed circumvallate cDNA library 
(Hoon et al), Cell 96: 541-51(1999) and in approximately 1 /300 cDNAs in our 
, subtracted library. The cDNA sequence from clone 501-PCR46 is shown in SEQ ID 
NO-1 andthepremctedaminoacidisinSEQIDNO:2. The corresponding in situ 
hybridization to a tissue section demonstrating taste cell specificity is shown in Figure 1. 

.Glene-SO-l-P^e-encodes^ajiQy^ member of the TRP family of ion 

channels (Harteneck et al., "From worm to man: three subfamilies of TRP channels," 
10 Trends NcuroscL 23(4): 159-66(2000)). A related gene known as Mtrl was previously 
reported to be associated with chromosomal imprinting (Enklaar et al. "Mtrl, a novel 
biallelicallv expressed gene in the center of the mouse distal chromosome 7 imprinting 
cluster, is a member of the Trp gene family," Genomics. 67(2):179-87; Esswein et al, 
,000)) and a chromosomal interval implicated in the Beckwith-Wiedemann syndrome 
15 region on human llpl5.5 (Yatsuki etal. "Sequence-based structural features between 
Kvlqtl and Tapal on mouse chromosome 7F4/F5 corresponding to the Beckwith- 
Wiedemann syndrome region on human 11 P 15.5, long-stretches of unusually well 
conser^d intronic sequences of kvlqtl between mouse and human." DNA Res. 7(3):195- 
->06 (2000); Paulsen et al., "Sequence conservation and variability of imprinting in the 
20 Beckwith-Wiedemann syndrome gene cluster in human and mouse," Hum Mol Genet. 
9(12)18^9-41 (2000)). However, no function, cell-type specific expression, or role had 
been assigned to this "orphan" ion channel. Clone 501-PCR46 appears to be a splice 
variant of Mtrl (aka LTRPC5; Paulsen et al, 2000), and is a component of the taste 
signaling machinery. 

25 p^ pteTTr Functi — i a™1*™ of TC-TCS proteins expre-ed in a heterologous cell 

TC-ICS encoding nucleic acids (e.g., SEQ ID NO:l. SEQ ID NO:4 or SEQ 
ID NO:7) are expressed in a heterologous cell, alone or with other cell transduction 
proteins such as a G-protein a subunit and/ or a taste cell specific G-protein coupled 

30 receptor such as GPCR-B3, GPCR-B4 (see U.S.S.N. 60/094,465 filed July 28, 1998 for 
the description of GPCR-B3 and U.S.S.N. 60/095,464 filed July 28, 1998 and 60/112,747 
filed December 17, 1998 for the description of GPCR-B4), Gal4 or Gal5 (Wilkie et al. 
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PNAS USA 88:10049-10053 (1991)). These transformed cells are used to screen for 
activators, inhibitors, and modulators of TC-ICS, including mdulatorts of its intewraction 
with GPCRs or G-proteins. Different assays for ion channel mediated functions are 
performed as generally described above and in PCT 99/06307, incorporated by reference 
5 herein. 

In particular, modulation of taste transduction is assayed by measuring 
changes in intracellular Ca 2+ levels, which change in response to modulation of the TC- 
ICS signal transduction pathway via administration of a molecule that associates with TC- 
ICS. Changes in Ca 2+ levels are preferably measured using fluorescent Ca 2+ indicator 
10 dyes and fluorometric imaging. The amount of [Ca 2+ ]i is then compared to the amount of 
[Ca 2+ ]i in either the same cell in the absence of the test compound, or it may be compared 
to the amount of [Ca 2+ ]j in a substantially identical cell that lacks TC-ICS. 
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WHAT IS CLAIMED IS: 

1 1 . A method for identifying a compound that modulates taste 

2 signaling in taste cells, the method comprising the steps of: 

3 (i) contacting the compound with a eukaryotic host cell or cell 

4 membrane which expresses a taste cell-specific ion channel subunit: 

5 (a) having greater than about 70% amino acid sequence 

6 identity to a polypeptide having a sequence selected from the group that consists of SEQ 

7 ID NO: 2, SEQ ID NO: 5, and SEQ ID NO: 8; and 

8 (b) specifically binding to polyclonal antibodies that 

9 specifically bind to a polypeptide having a sequence selected from the group that consists 

10 of SEQ ID NO: 2, SEQ ID NO: 5, and SEQ ID NO: 8; and 

1 1 (ii) determining a functional effect of the compound upon a 

12 transmembrane ion flux of a predetermined ion, thereby identifying a compound that 

1 3 modulates taste signaling in taste cells. 

1 2. The method of claiml , wherein the functional effect is determined 

2 by measuring changes in intracellular ion concentration. 

1 3 . The method of claim 1 , wherein the functional effect is determined 

2 by measuring changes in intracellular Ca**. 

1 4. The method of claim 1 , wherein the changes in ion flux are 

2 measured by an assay selected from the group consisting of a voltage clamp assay, a 

3 patch clamp assay, a radiolabeled ion flux assay, or a fluorescence assay using ion 

4 sensitive dyes. 

1 5 m The method of claim 1 , wherein the cell or cell membrane is 

2 attached to a solid substrate. 

1 6. The method of claim 1 , wherein the taste cell-specific ion channel 

2 subunit is from a mammal. 
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1 7. The method of claim 6, wherein the taste cell-specific ion channel 

2 subunit has an amino acid sequence selected from the group that consists of SEQ ID NO: 

3 2, SEQ ID NO: 5, and SEQ ID NO: 8. 

1 8. The method of claim 1 , wherein the host cell is a human cell. 

1 9. The method of claim 1, wherein the host cell is a HEK 293 cell. 

1 10. A method for identifying a compound that binds to a taste cell- 

2 specific ion channel subunit, said method comprising: 

3 (i) contacting control cells that do not express a taste cell-specific ion 

4 channel subunit with a test compound; 

5 (ii) contacting test cells with said test compound, wherein said test cells 

6 are transformed with and express a nucleic acid encoding a taste cell-specific ion channel 

7 subunit 

S (a) having greater than about 70% amino acid sequence identity to 

9 to a polypeptide having a sequence selected from the group that consists of SEQ ID NO: 

10 2, SEQ ID NO: 5, and SEQ ID NO: 8; and 

H (b) specifically binding to polyclonal antibodies that specifically 

12 bind to a polypeptide having a sequence selected from the group that consists of SEQ ID 

13 NO: 2, SEQ ID NO: 5, and SEQ ID NO: 8; and; 

14 (iii) identifying test compounds that bind to said taste cell-specific ion 

15 channel subunit by comparing the amount of said test compound that binds to said test 

16 cells to the amount of said test compound that binds to said control cells. 

1 1 1 . A method according to claim 10 wherein said nucleic acid encodes 

2 an amino acid selected from the group that consists of SEQ ID NO: 2, SEQ ID NO: 5, and 

3 SEQ ID NO: 8. 

1 12. A method of modulating taste signaling in taste cells of an 

2 individual, comprising 

3 administering to an individual a pharmacologically effective amount of a 

4 composition that modulates taste signalling by an ion channel subunit 

5 (a) having greater than about 70% amino acid sequence identity to 

6 to a polypeptide having a sequence selected from the group that consists of SEQ ID NO: 

7 2, SEQ ID NO: 5, and SEQ ID NO: 8; and 
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8 . (b) specifically binding to polyclonal antibodies that specifically 

9 bind to a polypeptide having a sequence selected from the group that consists of SEQ ID 

10 NO: 2, SEQ ID NO: 5, and SEQ ID NO: 8; 

1 1 thereby modulating taste signalling in taste cells of said individual. 

1 13. A method of claim 12, wherein said ion channel subunit has an 

2 amino acid sequence selected from the group that consists of SEQ ID NO: 2, SEQ ID 

3 NO: 5, and SEQ ID NO: 8. 

1 1 4. A method of claim 12, wherein said individual is a mammal. 

1 1 5 . A method of claim 1 2, wherein said individual is a human. 
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TGCCGTGGCTTCTACCAGGATGGGCGCAGGATGGAGAAGAGAGGGCCACC 
CAAGCGGCCTGCAGGCCAGAAATGGCTGCCGGACCTCAGTCGGAAGAGTG 
AAGACCCATGGAGGGACCTGTTCCTTTGGGCTGTGCTGCAGAACCGTTAT 
GAGATGGCCACATACTTCTGGGCCATGGGCCGGGAGGGTGTGGCTGCTGC 

5 TCTGGCGGCCTGCAAGATCATCAAGGAAATGTCCCACCTGGAGAAAGAGG 
CAGAGGTGGCCCGCACTATGCGTGAGGCCAAGTATGAGCAGCTGGCCCTC 
GATCTTTTCTCAGAGTGCTACAGCAACAGTGAGGACCGTGCCTTTGCCCT 
GTTGGTGCGCAGGAACCACAGCTGGAGCAGGACCAQCTGCCTGCACCTGG 
CCACTGAGGCCGATGCCAAGGCCTTCTTTGCCCATGATGGTGTGCAAGCA 

1 0 TTCCTGACGAAGATCTGGTGGGGAGACATGGCCACAGGCACACCCATCTT 
ACGACTTCTGGGTGCCTTCACCTGCCCAGCCCTCATCTACACAAATCTCA 
TCTCCTTCAGTGAGGATGCCCCGCAGAGGATGGACCTGGAAGATCTGCAG 
GAGCCAGACAGTTTGGATATGGAAAAGAGCTTCCTGTGCAGCCATGGTGG 
CCAATTGGAGAAGTTAACAGAGGCGCCAAGGGCTCCTGGCGATCTAGGCC 

1 5 CACAAGCTGCCTTCCTGCTCACACGGTGGAGGAAGTTCTGGGGCGCTCCT 
GTGACTGTGTTCTTGGGGAATGTGGTCATGTACTTTGCATTCCTCTTCCT 
ATTCTCCTACGTCCTGCTGGTGGATTTCAGGCCACCACCCCAGGGGCCAT 
CTGGGTCGGAAGTTACCCTGTATTTCTGGGTCTTCACACTGGTGCTGGAG 
GAAATCCGACAGGGATTCTTCACAAACGAGGACACCCGTCTGGTGAAGAA 

20 GTTCACTCTGTACGTAGAAGACAACTGGAACAAATGTGACATGGTGGCCA 
TCITCCTGTTCATTGTTGGTGTCACCTGTAGAATGGTGCCCTCCGTGTTT 
GAGGCTGGCCGGACTGTTCTGGCCATTGACTTCATGGTGTTCACACTTCG 
GCTCATCCACATCTTTGCTATTCACAAGCAGCTGGGTCCTAAGATCATCA 
TTGTAGAGCGGATGATGAAAGATGTCTTCTTCTTCCTCTTCTTCCTGAGC 

25 " GTGTGGCTCGTGGCCTATGGCGTGACCACTCAGGCCCTGCTGGACCCCCA 
CGATGGCCGTCTGGAGTGGATTTTCCGCCGTGTGCTCTACAGGCCTTACC 
TGCAGATCTTTGGGCAAATCCCTCTGGATGAAATTGATGAGGCCCGTGTG 
AACTGCTCTCTTCACCCGTTGCTGCTGGACAGCTCAGCTTCCTGGCCTAA 
TCTCTATGC .» AACTGGCTGGTC ATTCTGCTGCTGGTTACCTTCCTCCTCG 

30 TCACTAATGTGCTACTTATGAACCTTCTGATCGCCATGTTCAGCTACACA 
TTCCAGGTGGTGCAGGGCAATGCAGACATGTTCTGGAAGTTTCAACGCTA 
CCACCTCATGGTTGAATACCACGGAAGGCCGGCTCTGGCCCCGC.CCTTCA 
TCCTGCTCAGCCACCTGAGCCTGGTGCTCAAGCAGGTCTTCAGGAAGGAA 
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GCCCAGCACAAACAGCAACACCTGGAGAGAGACTTGCCTGACCCCGTGGA 
CCAGAAGATCATTACCTGGGAAACAGTTCAAAAGGAGAACTTCCTGAGTA 
CCATGGAGAAACGGAGGAGGGACAGTGAGAAGGAGGTGCTGAGGAAAACG 
GCACACAGAGTGGACTTGATTGCCAAATACATCGGGGGTCTGAGAGAGCA 

5 AGAAAAGAGGATCAAGTGTCTGGAGTCACAGGCCAACTACTGTATGCTCC 
TCTTGTCCTCCATGACTGACACACTGGCTCCTGGAGGCACCTACTCAAGT 
TCTCAAAACTGTGGTCGCAGGAGTCAGCCAGCCTCTGCTAGAGACAGGGA 
GTACCTAGAGGCTGGCTTGCCACACTCAGACACCTGAAATGGAGAAACCA 
CTTGCCCTAGAGCTCCAGACCTGGCCAGATTGAGGTTTTGGGTCACATCA 

10 ACCTTCCCCTGCCCCCAGCAGCCCCGAGACCTTGCCGCAGACCATGTCTT 
GGACACCTCTTCCTATGAAAATGAGACTCATGTCTTTGGCATCTATCTGG 
GAGCCCCAGGCGTCCTCTCCAGCAGGGGAAGTTTTCTCATGTCCTACCTA 
AAACTTTCACCAGCTAAGACTGGACAGCTGGAACTGGCCAAGTCCCACAT 
GGGATACCATCTGCCTGGATGGGGCTACTTACGTCTAGCCTGTCTTACCC 

15 TGAGTTCCAAAGAGGCCAACCTCTTAAACACTAGAGGTTTCCTTCTTGTC 
CTCTGATCCATCCATCAGCCGACCAGCTTCTAGAGGGCAGGACTCAGATC 
TACTGTAATCAGCTCCCATCCTTCAGCCCCCACAGCATAATTTGTGTGAT 
TGTCCTGGCACAAACCCCAAGATACTGCTCAAGGGTACCCAATGCTATTT 
TACTTTCTATAAAGCCTGTAGACCACCTCAACTAAGCTAAACTGGACCAC 

20 AGGGGTGGCTAAACCAACATTTCAAACACCTGGGGAACATGGAGTTATCT 
GACCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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SEQUENCE LISTING 
SEQIDNO:l 
Rat L-TRP taste cDNA sequence 
CAAAAGCCTCTGGAGAGCTGTGTCGAGGGTGTTGGAATCCAGATGCCCGG 
AGTTCGAAAAGTCACAATGCCGATGGCCCAGAGCTCTTGTCCTGGAAGCC 
CCCCAGATACTGGGGATGGATGGGAGCCAGTCCTATGCAAGGGAGAGGTC 
AACTTCGGAGGGTCTGGGAAAAAGCGAAGCAAGTTTGTGAAGGTGCCAAG 
CAATGTGGCCCCCTCCATGCTCTTTGAACTCCTGCTCACCGAGTGGCACC 
TGCCAGCCCCCAACCTGGTGGTGTCCCTGGTGGGCGAGGAACGGCTTTTT 
GCTATGAAGTCCTGGCTTCGGGATGTCTT.GCGCAAGGGGCTGGTGAAAGC 
AGCTCAGAGCACAGGTGCCTGGATCCTGACCAGTGCCCTCCATGTGGGCC 
TGGCACGCCATGTTGGACAGGCTGTACGTGATCACTCTCTGGCTAGCACG 
TCCACCAAGGTCCGTGTGGTGGCCATCGGAATGGCCTCTCTGGACCGAAT 
CCTTCACCGCCAACTTCTAGATGGTGTCCAGGAGGATACTCCCATCCACT 
ACCCAGCAGATGAGGGCAGCACTCAGGGACCCCTCTGCCCTCTGGACAGC 
AATCTCTCCCACTTCATCCTCGTGGAGCCAGGCACCCTTGGGAGTGGGAA 
CGACGGACTGGCAGAGCTGCAGCTGAGCCTGGAGAAGGACATCTCTCAGC 
AGAGGACAGGTTATGGGGGTACCAGCAGCATCCAGATACCTGTCCTTTGC 
TTGCTAGTCAATGGTGACCCCAGCACCCTAGAGAGGATGTCCAGGGCAGT 
GGAGCAGGCTGCCCCATGGCTGATCCTGGCAGGTTCTGGGGGCATTGCTG 
ATGTACTCGCTGCCCTGGTGGGCCAGCCTCATCTCCTGGTGCCCCAGGTG 
ACCGAGAAGCAGTTCAGAGAGAAATTCCCAAGCGAGTGTTTCTCTTGGGA 
AGCCATTGTACACTGGACAGAGCTGCTACAGAACATTGCTGCACACCCCC 
ACCTGCTCACAGTGTACGACTTTGAGCAGGAGGGTTCCGAGGACCTGGAC 
ACCGTCATCCTCAAGGCACTTGTGAAAGCCTGCAAGAGTCACAGCCGAGA 
CGCACAAGACTACCTAGATGAGCTCAAGTTAGCAGTGGCCTGGGATCGCG 
TGGACATTGCCAAGAGTGAAATCTTCAATGGGGACGTGGAGTGGAAGTCC 
TGTGACTTGGAAGAGGTGATGACAGATGCCCTAGTGAGCAACAAGCCTGA 
CTTCGTGCGCCTCTTTGTGGACAGTGGTGCTGACATGGCCGAGTTCT7 • iA 
CCTATGGGCGGCTGCAGCAGCTTTACCACTCTGTGTCCCCCAAGAGCCTC 
CTCTTTGAACTGCTGGAGCGTAAGCATGAGGAGGGTCGGCTGACACTGGC 
TGGCCTGGGTGCCCAGCAGACCCGGAAGCTGCCCGTTGGTCTGCCTGCCT 
TTTCACTCCATGAGGTCTCCCGAGTTCTCAAAGATTTCCTGCATGACGCC 
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SEQIDNO:2 
Rat L-TRP taste predicted protein sequence 
MPGVRKVTMPMAQSSCPGSPPDTGDGWEPVLCKGEVNFGGSGKKRSKFVKW 
NVAPSMLFELLLTEWHLPAPNLWSLVGEEMAMKSWLRDVLRKGLVKAAQS 

5 TGAWILTSALHVGLARHVGQAVRDHSLASTSTKVRWAIGMASLDRILHRQLLD 
GVQEDTPIHYPADEGSTQGPLCPLDSNLSHFILVEPGTLGSGNDGLAELQLSLEKH 
ISQQRTGYGGTSSIQIPVLCLLVISIGDPSTLERMSRAVEQAAPWLIIAGSGG1ADVL 
AAL V G QPHLLWQVTEKQFTIEOPSECFS WEAT^HWTELLQNLAAHPHLLTVYDF 
EQEGSEDLDTVILKALVKACKSHSRDAQDYLDELKLAVAWDRVDIAKSEIFNGD 

1 0 VEWKSCDLEEVMTDALVSNKPDFVRLFVDSGADMAEFLTYGRLQQLYHSVSPK 
SLLFELLERKHEEGRLTLAGLGAQQTRKLPVGLPAFSLHEVSRVLKDFLHDACRG 
FYQDGRRMEKRGPPKEPAGQKWLPDLSRKSEDPWRDLFLWAVLQNRYEMATY 
FWAMGP^GVAAALAACKIIKJEMSHLEKEAEVARTMREAKYEQLALDLFSECYS 
NSEDRAFAELVPJINHSWSRTTCLHLATEADAKAFFAHDGVQAFLTKIWWGDMA 

1 5 TGTPILRLLG AFTCPALIYTNLISFSEDAPQRMDLEDLQEPDSLDMEKSFLCSHGG 
QLEKLTE APRAP GDLGP Q AAFLLTl^WRKF WG AP VT WLGNVVMYF AFLFLFS YV 
LLVDFPJ^PPQGPSGSEVTLYFWWTLVLEEmQGFFTNEDTRLVKKTTLYVEDh^ 
NKCDMVAIFLFWGVTCPJvlWSWEAGRWLATOFMWTLR^ 
VERMMKDWFFLFFLSVWLVAYGVTTQALLDPHDGPXEmFRRVLYRPYLQIFG 

20 QIP LDEID E ARVNCSLHP LLLD S S AS CPNLY ANWL VILLLVTFLL VTNVLLMNLLI 
AMFS YTF Q W Q GN ADMF WKFQRYHLrVE YH GRP AL APPFILLSHLS L VLKQ VFR 
KEAQHKQQHLERDLPDPVDQKETWETVQKENFLSTMEKRRRDSEKEVLRKTAH 
RVDLIAKYIGGLREQEKRIKCLESQANYCMLLLSSMTDTLAPGGTYSSSQNCGRR 
S QP AS ARDRE YLE AGLPHSDTZ 
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SEQIDNO:3 - ! 

Mouse genomic sequence of the region that contains ltrpc5 
(gi|8574073|emb|AJ251835.1|MMU251835 Mus musculus Kcnql, Ltrpc5, Mash2, Tapa- 

1, Tssc4 and Tssc6 genes, alternative transcripts) 
GTGCCTTTGGCTCAAGCTTCCACCTCTCCATCTGGAAAAGGGCTCTCTCCTGA 
CCTCAAGTACTGAGGTCCAACTATCACTTATTGTGCTTTTAGTCAGGTGCCCC 
CTCCACCCAGGACTGCATCTTGGGCAAAGTCTCTCCTCCCTGCCTTTGGGAGA 
GTGGTTCCTGGGCTGCCCCAATTCATGGGTACTCTGATCCCTTAGATAAAATG 
GCATAGCATTTGCGGAGAACCTGCTTGTACGTCTTCATGACACCTCCCGATTC 
TTTAGAAGAGGAACTTCCACGTAATAGTCTGTAAACAGATTGAGTCAGAGGA 
TGAAGAACCTGTGGATGCTGGATGTCTGGACTATTTGCCTTAGCATACAGTCT 
TGGGAGGAAACAGCCAGCAGGACAGTGTAAGGGGCTTCCCAGGAGATGCTG 
AGCAAAAGGTATACGGTGACATGGGCCACATCCCAGGCCTAGCTCACAATCT 
AGCTTACTATCTCGATACCCATACTTTTCAGGGGAAGGAAAAGACCAGAACA 
GTTCAAACTTGAGTTATAGACAGCTGCTAGAGTCATTTACCCTGAGGTGGGCA 
CTGAAGTTTTTAGAGATATATGAGAACTACAGGGGAAGTCCCTTCCACAGCC 
ACCTATGCTGTCTCCCAGGGGTTATGCTGTAAACTCAGGAGACACTGCTGCAC 
ACACACTGTTGGTCTCAACCCTTAGTTCACACTCTACCAAGCATCCAACATCA 
TTGCCTCCTTCCTGCCAGGCACAAGACCTGGCCATGTGGCACATGAGGCTGGG 
CATGACTCTTAGTGGCCCAGATACTGAAAGGAGTGTCCAGTCTGCAACAGGA 
GACTGTACATCCACATCTGAGACCAGAAATGTGAAAACCCCAGACCCACAGA 
GCCCTGTCTCCTGGGACAATGCTTGACCGTATGTCTTCATAATGGAAACAGCA 
CCCGTACCTCCTGAGAAATCTCTCTGGTAAGGAGACOTTTACTCCTGCTGTG 
GTCCCCTTCCAGTCCAGAGGAAGGATATGAAGAGTGCCCTTGCCTATCTGCTT 
TGTAACTGGTGACCCAGAGGCACTTCTAGGAGCATGCCATATTCTCACTGCAT 
GGTCTGGAAAATATGCTTAACCTCTCTGAGTTCCCTTATCACAGAGCCTCACT 
ATGACCTTACAGAGGAAAAGCTAACTAGGGCCCAAGCATGCAAACCTTTCAC 
AGGCCTGCGGTCTTATCACTATGTCCCATGGGCTTGGCCTCACCCAAGCCATG 
*3AAGGAAAGATACAGGGATATAGGATGGTGATGCTAGATAGAAACCCAGGG 
CTGTATAAGAAATGTATCCATCAAGACATTGACCCCTACAAAATGCCCTCATG 
GTTTCACCAGCTAGCCACTGATCCCCCCAGGGCCTGCAGAGACTTCATCTTTA 
CCACCCTCACTTCTGGGCTTCCCCATCAGTGGGCAAACCTCCAAGAGGTCACC 
ACAGCCATCTTTCAAACAAAGACCTTGGGTGCTAAAAGGAGAAGCCCAGTGA 
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ATAGAACACACATGTCACCTTCCCTCTTATGGCCAGGTCAGGCCATGCTGACC 
CTCAACATTCTAAACCAGTAGTAGGTATCCCTTCAACAGACTGCATAGTGATT 
ACCACCTGAAACAGACTAGGAGAGCACGGCTTCAAAGCCGCCAGAAGCTTCT 
AAGTCTAGTCCAGTAGCAGAGACCTCAAGACATGCATCGTTTCCAAGGTAGG 

5 CAAGTGACTGTGTATGTGCTCTGCGGCCTTATGGACAGAGTGCCAGGGGAGG 
ACTAGGGACAGGTGGCATGGAAGCTCCCGACAGCCAGGTTGACCCTGCTGTA 
GGGTCAGCACTGAATGTCCTGCCCCTGCACACATGGTAGCTTGCTTACCTGAG 
CTGTCATAGCCATCAATGGAGAAATGGTCTGGCCTCCTATCCTCTGGGGGATC 
ATAAGTGATGTGAGGAACATTGAACATCTTCTCTCCAGGACTCATCCCATTAT 

1 0 CCTTATCCAGCTTGAACTTCTTCTTCTTTACCT AAAAAGTCCACATGAGGAGT 
AGCAGTTCCTGAGCACTGTCTGCCAGATCTTTAACCAACCCCAGAGCACCCGT 
GGAGTCCCGAATCCAGGCTTGAAATGTATACCCTTGAGGGAACTTGGGTAAA 
GGGGCTGAGCTCCAGTTCCTGCAGATATTGATGAAATGGGGCTTCAGTGTGTT 
AGTGTTGCTGGAGGGTTTTCGGGGGTTGCTTGGCCCAGTGTGGCCCTCTATGA 

15 GCCACAGCCTAAGAAAGGCAAGCAAACAAAAATCCAATAGACTGTAAATAT 
GCAGGCTTGCACCAGGGCAGTAGGGCTGAAGGCCAAGGCAGCAAAGAGGCA 
CTGGGGCCTCTTTAAAGAGGTCTCTCAGAGAAAGTGGGGAAGCAGGTTGTGT 
GAAACCATGGAGAGGGAAGTGCAAGTTCAAGGGTCCAGTGACAGTTGGATGG 
TGAGAAGGTTGTTTTTTCAAGGAGGGGGGAGCAAGGAAATGGAAGAGCCCCA 

20 ACCTTTTCCTCACTATTCCAGGCCAATGGCAGGACTCTTGTCCATAGCTACTG 
GAACTTGACTCTGTGGGCTAGAGTTCCAACACCTTCCTGTTCTGTCTGCTTTAC 
CCAGTAAGGAAAGCCCAGCAGCAGGGTCCTCTCAAGAGCTTCCTTACAGCAG 
CTGATGGAGCTGCCCTCTCCTATTCCAGAAGTACTCTCTTTAGGCCTATCATAT 
CCTGGTCTTGGCAGTCCAATCTGGCTCCAAGGGTGGAGGAAATGAAGTCAAC 

25 ATGGATCAGGCATGGAGTAGACATCCTCTGACTTCAGAATGATGTGGGCACC 
CTAATGAGCAGTACACGGAGTGGCAGTGCCAGAGTCTCACACACCGCCTCTC 
CAGCTTCCTGTCCTGCTAACATGTAGTTCCCTTCCTGATGGCCAGTGAGGCTG 
CTTACCATGACAGACTTTTTAGGTTTGGGGCTGGGGGACAGAAGCGTGTGACT 
CCGAGCAGGCTTCOi GACATAGATCTTCCAAGTGGCTGAGTCAGGGTTCTCA 

30 GCGGCATAGCACCTCCATGCAGTCTGAAATCAGGCAGAGGGGAAAGGAAAG 
AAGGCTCATTGGTGCAGCCACTCAGCAGTACCTCTCAAAAAATCTTCATGTCC 
TCCCCGAGCCCCAAGCCCAGCTTGTTCCAAGGCCCCTTCCCTGACATCAAGGC 
TTAGCCCTCTAGAGGGCTAGGAAGGAGAAGGCCTCATTCATTAACTATCTGTT 
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ACCTGAGCCACTCCTAAAGAGAGCCTTGAGCAAGTCCCAGACATACAGAGAT 

GCTCAACTAAGGCAGAGTTAGATAACCAGACTTGACTCAGAGGACAGTAGAA 

AAGGACTGCCTGAGCCTTTGTGAGCACCAGCAGGTTGTGGAGGGGTAGAGCC 

CCAGGGACCAGTGGGCGATGAGGCCTCCTGGGACTTCAAGGGCTAGACTGTG 

GCTACGAGGAGTCTCTGGTCACTACAGATGAAACTCTTGGCCAACATTCTAGA 

GGGAAATGCCCTCCCCATAGTCATCACATCAAGATGACCTGGGGAAGACACT 

GGAGGGGCCTAAAGGAGCCAAGGGAGCAAGAAGGGGGAAGGAACAGATAC 

CAAAGCCTCTTATGGGGACATACCATGGTGACTCTGTCATTATAGGCCCTATG 

GGCATGAGAAAGTAAAACCAGTTGAAAGTTCTCTATGCTGGCCAGGCTTCTG 

AGTGACAGCCCCATGCTATGACAGCAACCTAGCATAGACACAAAATCCAGGC 

AGGGATACTTATGACCTGGGACAGTTGGAGCCCCTTCGAAGGGTTTTATGTCC 

CATGTATATGCCCTATCTGGGCAGCTCCATACCTCATGCAGCCTGCAGCTCCT 

CCCAGCACACTCCATGTTGCCTGTGTTTCCAACTCCCACCAGTAAGATCCCTG 

GCACATCTTGCACTGTGGGTATAGCTTCTGAAGGAGTCTTGTCCACAGACCAT 

GCAGTTCCTGAGCAGTGGCCTCTGTGCTCAGCCGAGGAACTACATAACTGCCT 

GACTATGTTAATGTGGCCAAGGCAGGTTAGTGGGTCTGATTTAAGGTTTTCCA 

AGAATGAGAAGCTGTGGGGTCAAGCCAGGCCTGGGAAAGCCCACAACTGCA 

ACTGCTGGCCTCGTGGCTGTTGCTGTCAGTGGCCAGGTAAATGGCTCTGGAAG 

GGCATCAGAGGAACAAGCTGGGGCCCAGAACTGTGCTAGAAGCTTAGATAAG 

ACCATGAGAGCTACGAGCTTTGAGAAGGGCTGCCATGCCCCTCATCCAGGGT 

TCTGTCCAGATCAGAGAAGTCAGGAAACTGAGGCTGACCAGCAGAGTACAGA 

CTGGCTAAGTAGCCATGGTCAGGCCTTGCCCTAGTCTGCAAGTATGGCCCTCA 

GTGGGAACAGCATGAGGAGCAGTGGTCCTGCATCCCTTTGCCCACCCTATCAT 

CCTAATAGTGAGTACTTTCCCAGAGAGCCACCTCACCATGGGAAGAGCCGAA 

GCAAAGGGATCCATGCCTCTCAAGTAGTGGGCATGGCTGGGAGTGGTTCCAA 

CACCCAAATGGATGTAAGTACACAGCACTGATAGCCTGCCGGGCGCACCGCA 

TACCTGGATGAGTGAGGCTGCAGCTGGGATCTGCCGGTTGAAGTGCTTCTGCC 

TCTGCTTCTGCTGGACCTTCAGCGCGAACCCAGAGCCAAGTATCCCCTGGTAA 

gggacagaaggtgcccactgttacataa;ccatgctagccagtgcccagcag 
aggatgagggtcccaatgaagagcaagaagcaaaagcatctggataccagg 

TTTGCACAGTGCCCAGGGCTGTGGCCAGGCAGGCACTATCAGAAGGTGGGTT 
TGAGACAGCAGATGTCACATGCGGGCAGTCTCCAGGCTCAGCCTGCCTGGCC 
TTGCCAGCTGGGGAGGCAGAATCCATTTCTCAGAGTCGACTCAACATGAAGC 
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TTGGCTCTAGAAGAGACCTGAGACTCTACACATTTCTTGGCACCACCTGTGGG 
AATCCAGGTAATTTAGCCATCACCCCTTGCCCCTGGGGCAGAGGATTCCTAAC 
CCTAGGAGAGGGGGAGTTTGAGGCTGTCCTCAGCAAGAAGCCTACAGCTACT 
GTAGCCAGCAGTGGCAGAGATGGAATACTGACTCTGCAGCTGGGGACACACA 
5 CTGTGTGTCATGACCCATATATATGTGTCACACATGCACATATAACAGACACT 
TGAACACATATCATCTCCCATGTGTTACAGAAACTCATGAAACATAAATAATA 
TATGGATACGGAACATGTGTGCAAATATAGGTATATACATGACATACATGGA 
ATCACAAATACACACACATACATATGTATATATGTACACATAAACAGTACAT 

atggtaaacatgctattcaaatgtaatacagacacttgtacatgtaacacatg 
1 0 tatacagtatatgcac acaacatgtataagacacacctacaatgac agatat 
atacaagtaaccagtatttctcccacctagcacacatgtgtagacataggttt 
gtgatttttggaaaagtcacaagacagagccactcctggctatagagcagac 
ctgcaggttctgcccagtctgctcccctgtaccttgccgcttctgtgaagtacc 
acaacaaggcctccttactctacttcaaggaatgcctagaagttggggcaag 
1 5 ggtgagtgggaaacaagggaaagggctcttaataggtacccaccgctggga 
gtgcaaagaaggatatggcgaagacagagaaacaggaggcgat'ggtcttcc 
caacccacgtctgaggtaccttatccccgtagccaatggtagtgactgtgacc 
tgcaaggaagaaggactgcagtcagtggtgggctggctgagagcctcagaa 

AAGACACCTGCCAGGCCCAACACAGCGGCTTGATGACAAAAGAGACCAACTT 
20 AAAACCCTGCCTACCAGGATAGTGAGGCGCGGTTCTGAGAAAGTACCAGGAA 
CAGCAATATGGTGAGTTGGTTCAAGCTTCTGAGTCTCCTAAGTCCTTCAACTC 
TGATTACCACAGATGGCACAGCCCTAGGCTTACAGATAGATTGGGGTGTGCA 
GGACAGGCAAAGAGCTCAGAGCCCACCCACAGGGACCAGGCTTCCTGCCCCA 
TGGAAAAGATCCCTCTCCTGGCCACCAAAGTTGCAGTGCACTGGCCCCCAGG 
25 AGGCCGGAAGCTCCCACTGAGGCCAGCGGTCCACAAAGCCCCAGGATGGGGC 
TGACATGCTGCTTCCTGCCACCAGCCCAGGGATCCAGATGTTTTATTTTTCCTT 
TTCAGAGAGAATAAGGCAAGGCTCTAGCCTTCTTCTTAGTGGGGGAGCAGCG 
TCCAGGGTGTGCAGTGGGACAATAGGAGTGAGGGGCTGCCCCCCCCAGGCAG 
TCCCCTGCTGACATCTGAGGTTTGCTCCTGGGGCCTCTGATGAGT. jACTTGGG 

30 ggcccaccctggctgtgcctaactgctaggtcacaccatgaggctgccttggg 
cctgctgggagaacagggtggtcacattcactaacacgaagcccttggggtc 
acaacacactgtccagatcaggccatcagtgaaggcaggtcaqtgtgatggg 
ataccctgcccactcaaggtctttaaagtctgatcactctagacacctcagct 
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GTCACCCCTTGGGGCCAGGTGACAACCACCCAGCCAAGACCTATATGAGAAC 
CTCTAAGTGAACCTCTGCCCAACAAAGATGGGGCCTGGGGAACACCATCGAA 
TAGACTCCAACCAAAGTGAGGCCCAGAGAGGAAGCCTCAAGATCCCCACACT 
ATCCAAGCCCATGGCCATCACTACCCACTCCATTTCCAGATGGACTTCTGTGG 
5 ACCACAACTGTTCCCCAGTGGCCACTGAAGAGGAAGGGAAGGCAAGAGAGG 
CATGGGGGTCAGAAGTTAATATGTACATATCGTGAGCATAGAGATAAGTGCT 
CAGAGCCTAGGCAAGCACTCTGACCAACTTCAGCAGGGGAGCAGAGGACAA 
GGACAGCAAGAGTGACCCCACAGTGCCAAGTCTGTTGCTTTGGCCAGTATGTT 
AACAGGTCATGAAGCAACAGGTCAATTTTTCCTGGATGGCAAACAACAGAGC 
10 AGGCTACAGCTGGGCCCAGGCCATCACTGGGCCACCCTTCAGTGCTGTACCAT 
TAAATAAGAGGCTGCATGCATACTGGTGTTTTCTCCCAATTGCCCACATGAAA 
TCTTCCCCACAATGTCATTATCCTTAAAAAAATCCACACCCTACATGAACAGA 
GGACATAGCAAACCATCATGTGAAATCTTCCCCACAATGTCATTATCCTTAAA 
AAAATCCACACCCTACATGAACAGAGGACATAGCAAACCATCATGATGGCTG 
1 5 GGCCCTTT AAAGGG ACC AACCTGGTCTAC ATATGTGGGC ATAC AGTTGT AAA 
CATGCTTCACATGGAGACAAGGACACCCATTCACACTCATATGCACACGAGG 
GAACATGGGGCCACTGAGTGAGCTGAGGCACCATGAGGGTGCACATACACAC 
TGCGTAAGTTCACAAAGGGGTCTAGAGGTTATACTCAGGACTGACAGCCAGG 
GCACGTGCTGTCCAGACATCTTATACACACACCTGAAACCACTCACATGATGT 
■20 AAGTTGAAGACAAGCCATTCCAGGGACGGAGGACATGGCTGCCTGCAGGCTG 
TGGGATGAGAGGGCTCAGGAGAGCTGAAGCTCCTCACAAGTGACGTCTTCCC 
AGAGGGTGTGCACCCCACATGTGAGAGCTAACCCACAGCTGCCTTCTCTAGG 
GAGACAGGCACATGACAGGAGCTACTTGTGAAAGCAGACCTGGGCTGGAGCC 
AGCAAGAGTTCAAGATCACCTGGGATATATGGCTTATTTGAGGCTAGCCTTGG 
25 CTATATGAAACCCCATCCAAAAATGGGGGATGGGGAGGAAAACATCTACAAA 
GGTGGAGGGAGAATGGCCTAGAGGGCAAGCTGGGAAGAAGCATGGGTGAAT 
TGAGAGGAAACTCCTTATGTAGGTAAAAATACCTGAGGCCATGCCTTTTCCCT 
ATAGGCTTAGCTGTCCCTCCACTCCTTGGATCAGTAGGAGAAAGGGCCCAGT 
GCA^.TGGTCCTGAGTGCTGGTCAGTTGTCTGTTGGAAGGCACCTAACCACTAG 
30 GACCCTCTTrCTCCTGGGCCACCACAGAGCTCCGAATCTAGTTTATGAGTCCT 
CTCTCCCAAGTAGCCTGTCCCTATCAAGTTGTTrTTCCCCAAGTCACAGACTC 
AGCAACTCAGGAATGGATCATTTTATAGAGCTGTTTGGAACTTGTCCCCAATG 
GCTAGATCTGCAAGTCTCTTGGGCCAATGAACAGAGGGGACCTTTCAAGGGC 
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CCAACTAACAATAGGTCACCTGAGTCATACACACTTGAGTXCATAGATTGCTG 
TCCCAGTGACAGAACCTGTCGGGCAGGCTCTGTATAGTTCATGGCTCTCCTCG 
TCCCCCTGAACAGGCCCCTGGGAAGGATTTTCAGAACTAGTAGTACCTAGTCT 
CCTCCTTTTGAGTCTAGAGGTAGCCTGGCTGAGAATCTCCAACCACAACCATT 

5 CCTGAAGCTCACCTTCTGCATGCTCACTGACTTAACTTGTCTATATCTATTAAC 
TAGCTGTGCCCAAAGAGGCTAGTTGGCATGCCCATGGCTTCACAGCTGATGCC 
TGGTGGACATGGGAGGTGAGTACAGATTGGTCTCTGTGTTGACCTCACAGCGT 
GTCAACCTTGTACCTTATGCTTCACTACTTCCTGATGCTGTCTGCTTCCTCGTG 
GTGAAAGGGTTTCTGGGAAGCTCTATTCCTGAGATTTACACAGGCAACTCCTC 

1 0 AATAAC ATGATTTCAACCCATGTTTAGCACCACCTAACCTGCTGAGTCTGTAT 
AATAACTCTAGGCACTGCTTACTGAATAGCAGCTACAAAAAGATGGACATGC 
TTTGTAGAGACTCCTCATCATTCTATAAGGCCCAACTATAAAGATTCCTCCTC 
CTTGGGAAGAGCTTACrAGCAAGGAGTCTGGCAGGCCTTCAGCCTTTGCTGCA 
TAATTCTCATAGAACCAACACAGAAGAGGAGAGGGCTTAGTGCTACCAGCCC 

1 5 TGACCCCTGCTCTGGCCTTC ACTTCTCCATAGTCAGATCAATCCTGAAGAATA 
ACGTCTCTCGTGGATAGTTCCACGAGGCTTGCATGTGACTATCTAAGAGACTT 
CCTCAACTCCAAGGCCCTTCTGTCCTAGCTGTGCAGGATGCAGCCCAGCCTGG 
GAGGTGGGAACTTTCCAGAGCCTCCAGGTGAGGAATGAAGAGCCGTCAGACT 
GGGTTTTTTAAGGGACTGAAATACAGGGTGGGGTTTCAGCCTGCTCACTTGAA 

20 GTTCTGTATTTAGCCACTCAAGGTAAACAGAGCCCAGCAGCCGCCTGTTTTCA 
GAGACCCCCAGGCATTAAACAGAGGAGCCCATGTTTGCAGAAATGCCATTTT 
CCTGCACTCAAACTGTAATACAAACAAATGCATTCTGTTTCAAGGGTAAAGG 
AAAGCTTGAGGGGCTCCCCAAATACTTCCCAGCCCACACTTGGACATTATTTT 
AGGGGGTTCTGGGTTCCAGGGACCTCACACCCATGTGGGTATTCCCACTGGA 

25 GAAAGGGGATGCAGTCCATGAAGTTTGCTTTCCAGCTGTAGGAACTCACAAT 
CTATGAAGACAGGGCCACCTCACAAAATGCTGCCTGGATACCTACATTCTGG 
ACTTATCTGTGTCCCCCACATGGCTCACCCAGCCTCACCCCAGATTATGGCCT 
TCCTTCTGGGTCTCCTGGGTGGACCCTAATCCAATGCTGCATGTCCATGGGAA 
AC AC AT AGGAGCGAC AC A'-s. G ATCCT AGGGCTACC AG AAACTT AACTT AAT AC 

30 TAAAAGTCCCTTCCCAGAGTCTACCAAGTGAGTGTGGCCTGCCCATGCCTCGG 
CTTCAGACTCTGGTGTTCGGAGCAGAACAGGACATACTGAATCCACCTATTGG 
CAGTGAACAGGCTTTCCTGCCCCTTCTCACAATCTGGTTTACTTACTCAGTGA 
ATAAGCACTAGTCAGAGAGGACCCTCAGTTTCATCATAGGCTCTGCAGGCTA 
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ACAGGGTGAGAAACATTCCTCCCTATGCAGGCATGCCTCTATGCCCAATGCCT 
ACCTCAGTGTACCTTAGGGGAAGCTAGTCCAACTCTCAGGGACAACATTCTCT 
GGGGTAGGGGACAGGGCTTGAGGCCTAGTCCAGGACAGGGACATCAGACAG 
AGAGGGGCCTGGTGTTTAGAAAATGCTCTTGCCTGAGGGTATGCACATCACC 
. 5 CCCACATTAAACAGATGAGGAACCCAGGGACAAGGTCGGGAGGGGCCATAC. 
TTCCCTTATGGAATGAGGGAGACAAAGGAGATGCCTGTTCATTGGCTCAGGG 
CTACAGAACTCTGAGCCTGGCTGTCCTGTCCTACAGGGCTCAAGTTCTGATTA 
CCCTAGAGGTGCCAGGAACCCAGTCTTGGGCTGCTCCCACTAGACAGCTGCA 
GAGACAGCCACTGAATGAATGAGGGTACTGGGGGGCCCTTGGGCAGCCAAG 

10 GGCACAGGGCCTCACAGGAACAATGGCCAGGCCAGGGTGCGCTGTGGTCGAG 
GCCTGGGCTGCAGCTGACGGAGTAGGCCAGACAGAGCACCGGTGCCAAGCTG 
GGCACAAAGGGGCCGTGTTCCCGGCTGGCGGGCGGATGTTTGCACAGGTGCC 
GGAGTGGGGGCACCTTTCTGCCGAGGGCTGGGGGGCCAGGGGCTGGGGGGCT 
GGGAAATAGTTCCTGCAGGAGGCTGGGAACATAGGCAGGGCAGCATCAAGG 

15 CTGGGTACTGCTAAGGCTGGGCATCACTTATCAGCCACACTGTAGGTAGTCAC 
CCATTTACTTCCTGCTGTACCTCAGCTAACACTGCCCATCTCCCACCAGAGAA 
TTCTGGTTTGGAGTTGCCTGGCCCCCATGCAGGCCCTGCTACATCCCTGGGAT 
GTGTCCTCTTTCCACAGGCAAGCTTTCTAACTTTCCTACTGATTTTTCCTACCA 
ACCCTGTTTGGTTCTATCATGCTAGGGTATCGGGGGACATTGGCCCCAAGAAG 

20 GGTAGGCAGCATTTTGACCTGGGGCCTGTCCCAACCTGGAAGCTTCTCAGGGT 
ACCCTTACCTCCTGAGGCTGTTTAGGATGCTAGAAACTTCTAGAAAACGGACT 
GATTCGGCCAATGTCAAGAGGTCTTGCTATGTGCCCTGCTTCCTGCAGGCACT 
TGGGCATTGTGAGAACATGTGTTTATTCAGCTCCCATTCTCAACCCCAGTCTA 
GAAGCTCTGAGAGTAAGCAGGCCTCGGAGACTGCTCAGTCCTGGTCGGGGTG 

25 GGGGCAGTATTCAGGCCAGGGCCTGATGCAATCATCATCACCATGGATGGCT 
GCTTGTTCTGAATCAGGTTCCCAGAGTCTGTGCTGGCCCTTTCTATAGGATGC 
CACAGCCCACCCAACCTTCTCACCAGCCTAGTGAACCCCAAAGTCAACAGAC 
TGAGAATGCCATTTAGGATCGTATTGAAATCCTCGTGCCTACCCAGGGGTGCC 
AGCCCTGGCAGCTCCTCCCGGTATGATGGCATAAyi3CATCTGGACAGTCTATT 

30 GCTCTTGAAACTGAGGGGAGGAGATAGAGTTTTAATCGAGGGCAGACCATAC 
ATGCTCTGAACTGCTTGCTCCTGTTAGGTCCTTGTGTTTCCTGCAGCTTAGAGG 
GCCCCAAAGCTCAGTGGCTGGATAAAGCCTGCCTCCAGAGGAGGTCACGAAG 
GCCATGTAGCAGCTATCCCATAGGCTTTTAGGCCATGAGCAATGGTTGCTAGG 
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TCATGGAGTACACCTGGATTTCTTCAGCATCACTCTCACCAGAACCTCTCCCT 
CCTCTACCTGTTATCTGACACCCAAGGAGCCACATACCTCTACACACACACAC 
ACACACACACACACACACACACACACCACATTCACACCAAACACTGGCCTGC 
ACACAGTATCATCATACTTTGGGACATTCAGCCCTAGCACGAGAGGTCACTGT 

5 CACACACCAGACTCCACACAGACTGGCAGCTCAGTGAGTTTGGAGTTGCCCT 
AACCCGAGCGGTTTATGGCCATCTTTATGACCTTAGGGCCTGACTGGCACTCA 
TACTACTGCTGCCTGGGGCAGCGGAAGCCTTTGCAGACCTCATATCCAAAAG 
CCCCTGTGTCTTTCTCTTAGCTCACAAGGTTTTAATGGTAGTATGGCATTGTGT 
GCCTTGATCTCTGGCACAGGTGTGCTCCCTGGGTCAACATAGACACCACAAAC 

10 ATGCAAAGTATGTTTTTGACCCCTTCCTGCACCCTAAACCCAGGATCCCTCAG 
GGTAAGGATCACCACAAGGCTGAGATATGAAGTTCCAGGTTGCAATCCCTTG 
TCCCATCAGCCrfcCCTTCCAAAACAGAAGTTGAAAGATAACAATGCTCAAAT 
TTCAAGACAGGGACTGGAGAACGCTTACTGTAGTATACATTTGTAAGCTGTA 
AGTACACATTTGGTGGTTCAAAAGTTTTTAAAACTGGGGTTAGTAACCTCAGT 

15 GGAGAAGGTAAAAACATTGAGCAAGTAGAGTTCAATATCCTAACCCCACTGC 
CCAGATCCATGTATCTCCCACCTGAGACCCCCAGTAAAGACCCCAACTTACCA 
CCCCCCACCACAGAGCATCTGCGTAGCTGCCAAACTCGATGCGGCCGGACTC 
GTTCACCGCATCTTTCTCAGCCAAGTAGACAAAGTAGGAGGAGAAGATAAGG. 
CCCAGAAAGCCAATGTACAGGGTGGTGATCAGCTCCTAGGGACAGACAACAC 

20 AGAGGCCATGGTCACATACCCTGGTGGTCCACATGAGTATGGTGAGAGGAGG 
AAAGTTCCAGAATTTGCTGTTAGATTGTACTGTGTTGACTGTAGCATGTTATA 
CACACAGAATCACCTTCCAGAGTAACCTAGCTTTTGCAATTCACAAAAGTGGC 
AATGAAGAAGfCTCAGCTATTACCCTAAGTCAGCCATTTGGGTCCATGCAGTC 
TCCGTTGTACCCATGACAAGATAGTACTTAGAGCCAGCTGTCCCCAGCAGCCT 

25 CTCAGACAGAGTGTCCTCTGGAAAGTGTCTGAGGCACCACCCTTGATTTCCTA 
GGTTTGATTTCAGAGCTTCATAAATCTGTTGTTCATCTTACATTTTCTACAAGA 
TGCAATGGTCTTTTGTCCCAAGCCCAAGTGAAGGTGGCGGCAGCTCTGCACTG 
TrAAGAAGCCAAGTATTTTGCCATCATGAATGTGGGGTAGGCTCAGGCTGGG 
AAGGCTCTGGGGTCCTGGACCCTGTCCTCAGACTCACATGTCTTGGTATCi'GC 

30 CTAGACCTGGTCCACTCAGCTGGGGTCCAGGTACTAAGATATCACCATCACTG 
CATCCTGACCTCATAAGTCTCCAGACCTCGTGTTTATAGGCTGGCCAGCTTCT 
GTTCTGCCCACACCAGGCTTACTCACCTGGCGGTGAATGAAGACTACAGAGC 
CCAGGAGCCTCCAGGTACCCCCCTGGCGATCGACATGCAGCATCCGCAGGAT 
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CTGAAGGAAGCGGATACCCCTAAGAAGCAAGAAGGGAGAGGTCAAGAGGGC 
TGTGGTGGAGACAAGCATTTGCTTCCTATGTTCCTGTCTCCACCAAGCTTCCA 
ATGGGTAGAGAAGACATGTACATGACACTTTGTACTGGAGCCATCTTGCTTCC 
TATAGGTGCCTGAGAACCCTCTCTCCAGGAGCCCTTTTATATGGCCTTTTTGG 

5 GGGAGACCAGAAACCCAGGGTCACCCATATGGCAACCACACAGTCAGGCTAT 
CATCTAAGTCTTAGAAGCTAGGAAGTACTGGGGTCAAACCTTGAACTCAGGG 
TTGCATACTGTTTCAATATGATGAACTATGGCCACCCTTCCCCTGTGCCACTC 
ACCATGAAAGAGACAGGTAGAGGGAGGGAGAGACTACAGCTTAGGAGGAAA 
GGGCTTTGTAGTGGGTCTTCTCTCACAGGAAGAAATGGAAGAGCCAACATAC 

10 CTGATAGCTGATGTGGCGAACACTTGTCCTTTGGAACCCACGCAGAGGACAA 
CCATAGAGGCTACAACCACGATGAGGTCTGTGGGAGGGGCAGGGGGTCATGA 
CTTCTGGTAACCAGGAATCTGGGGAGGGTAGGCTACCACCCACACCTGAGGG 
TTATGCCACTAAAGCAAGATGGGACCCAATAACCAAGGGCACAGGAATAATC 
TAGGGGTCTCCATCCATGGAGATTCTCTGGGGATGACTGTAATGGGAATGTCC 

1 5 TTCTGGACCCATGAATGTGTGGAATGGAATGTCTGCAGGATGTGTAGGCAGG 
ACTCACCAATGATGGAAATGGGCTTCCGGGCAAAACGTAGCCGGCCCCAGAT 
GCCCACGTACTTGCTGCGGCAGCCTGCAGACCAGAGGCGGACCACATATTCT 
GTCCCAAAGAACACCACAAGGACAATCTCCTGAGAAAGAAAGCAATAAGCCT 
GTGTTAGAGGCTATGACCACTCAGGCTAGAGCACCTACACCAAGCTACCATG 

20 CCAGGAGCCTCTGCTGAGACACAACCGTGACTAGCAGGCTCAGGTATCCATC 
TGTGGTAACTCATAGTACCTATAACAACTGTCCAGAGAGGTTAACACACTGA 
GCCCAGGGATGCTAAGAGTAGACAGTGTCAGCTGTTATTTGCTGGGGAAGAC 
TCCACACCAGGGTTGGGGTTTAATGATCTGGTCTGGTAACATAGTGCTGACAT 
TTGTGTTGTCTAGCTTCATGGAATCAACAGAGCTCAAGTTCTATATCTGAAAA 

25 TGAGGGTTCAAGGACATGTGTCTTTTGAGGACTGTCTCCCTTAGAGCATAGCT 
CACAGCTCTGCCTCACACTGGAGACACCGTACCCACCATCCATAGCCTCTATT 
GAGGTTTGGGTACAAAAGTCCCCCCAATCCATCCAGAGGCAGTAGAGGAATC 
AAGCAAGGGCACGGGCAGCAACGGATGTGAAATAACTCTCCTAGGTCAATTC 
AC AC AGACC * . GACCTCTCCC AATGTATCC ACATAC ACTCTGAAGTC AGAGTG 

30 CACTGAAAGGGAGGCTTTACAAGCTAGAGAAACCCAGGCTGTCATCCAGGTT 
ACTTACTGCAGTGGATGGACAGATCAACACAAAGGTGGATGGATAGAAGAG 
GAGATGGGTGAGTGGACAAATGGATAGGCATGTGAGTGGATGGATAGATGG 
ATGAATGGAAGAATGGATGGATAGGTAGATGGATGGATGAATGAGAAGGTG 
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GGAGGATGGGAAAGTGGAGAGATGGACAGATGGATAGATQGACAGATGGAT 
GGATGGATGGATGGATGGATGGATGGATGGATGGACGGATGGATAGATGGA 
TGGGTGTGTCGATGGTTGGTTGAGTAGATGAATAGGAAGGTAAATAAATAAT 
TGCAGGTGACTGGTGTGAGAACCAAGTTAAAAAAAAAAAAAACAAAGTGGA 
5 TCCATTAGCTCTCTCAATAAAAACAGACTCAGCCTCTGACCATGATTTATAAT 
TCAGCTAGCAGCAATCTTGGGGCCTTCCCAGTCTGTGACAGCACTGGGTGAG 
AGCAACTGTACTGCTTTGTGTTCACACAGCCTTGACACAAACTGAAGGGACG 
GTCGCTGGCGGCCTGGCAATCACCATGAGAGAAGTCAAGGTAAGCCTGGGGC 
CTTGCCTTTGATTGGTGAATAGCCTCTCCTTCCCTAGCCTTCACTATCTGCCTC 
10 TTCAATGAAACACCCAAAGACCTGTCACATGCACAGTATCTGGAGGGTGAGG 
CATTGGCTGGCTTGCTATGGCATAGTGATGTCAGAAGGTCTGGTCTGGGATAC 
CCATACTCTGCTGCCAGTTTCCGTTCTCTCCCAGCTCTTGCCACAGGGTCATTG 
TCAGGGACTGGCTGCCCTGAGTGAGAATCCCCTTCCTGCTTGTTCCACCCCTG 
GGCAGCTCAGGTCTGATCCTCAGTCCTGCTATGGCGTGAAAATGTGAGTAGCC 
15 GCACCACAAGTGGGTGATGGGAGGATTTGAAATAAGTATACATTCCATGCGA 
ACTACAGCCTATGGTGGCCTCCAGGTCTCCAGCCCTGTCCCAGACCTGTTCCC 
CAGCTCCCAGCCAGCCTTCTTTGGTATGAGGCCCTTGGGGAAGGCCTGGGGTG 
AGGAGGTGAAACCACTCCCACATATGTTAGGGGACTGTGCAAAGCCACAAAC 
CCGAGAAGCCACAAAGCATTGTGCTGCTCAGGCCTGCTTCCAGCTGGCTCCCT 
20 GCCTGGCCTCCCTCAGGAGCCTGGCCCTGGAGGGAACTTCCTCTTTGAAGGGT 
GACACATGCTGGCCCTGCCCCCACCTGTCGGCTGCTGGAGAGTCCACCATGA 
GTGAGGGCTGAATACTGCACATCATATATAAATGCTTTTAATTAAGACAAAAT 
AGTACACCACCTGGCTTGTGTTTTGGGCATGAGCAGTGACTGATGGAAGGAG 
TGAGCTTGGGAGATCATAGCTCCTTCCTTGTGAGAAAAGCATTTGGCCCATGA 
25' AGTGAACTCATTGGGAATGGGGGGGGGGTGGCTATGTGTGGGAGAGCGAGA 
GAGTGCTGGCAATGATGGGCCCAGTTTGGCTGAGCATACCAGGGAGGAGCCA 
TAGTCTCGTTCACTCCAGTCAGAGCTCAATATTGGCCTGTTGAATGAATGAAT 
GATGAGTGAATGAATGAACGGGCAAATGAACAATCTCTGAGCAGCTGGCTTG 
CAGGGATATTGATTCATCTTTGTT^CTTGAGAATCTGGGTCTGTAGGGCTGTT 
30 GCCCAAAGTACTGTGTGTATCTTTGGCCAACATGAAAAACATACTCCCAATCC 
CCAGCACAGGATATGACTTCATGGGAGACAGCATGCCTGACCATGAGAAGTC 
CCAATATTTAGTTACAGTGCTAGCCCAGGCAGAAGGCCTCTGTTCCAATTTCA 
TGGCCATACATCACCCAGCTGGCTGATCATCTAGTGAGTTTCTCTCATTTGCA 
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GATGTTTGTCATTATTCAAATATCTCCTTAGCAATGGCTTCTGATGGCTACAG 
GTGCCCCCTGACCTAGTACTTGGAGGGAAACCCATTTTGGAATTGGCTATGTG 
AAACAGCAAGATCCCCTTACACAGATCTCCAATTCAGCATGTGAACACAGTG 
TGAAGCCTGGATTCAGCGGACTTGAGCCCCTTCATCCCCTGAAAATGTCTGGC 
5 TAGTTATGTCCAGGGCAGGCCAGGCTGAGTTAATCAGTAAGGCTCAGGGGCT 
TCTAGGAAAGAGACATAGGAAAACAACCCTGTCCTAGGTAGAATGTAGCCAT 
ATCTAGTGTTTAGACAGAGATAAAATGGTTTTCACCAGCTTTGCACATATACT 
GGACCTAGGAGGAGTAGCCCTCAGACCAAAGGTCACACTGGAACTCCAGTGA 
AGAATTAGCATTTAGCAGTAGTGTGAGTCTGTAGGAGACACACTGTGGGGGA 
1 0 CAGTTTAATTCAGGTAACAACACCCTAGTCCATAGCAGGGTTAGTGGGGCGA 
GCTTAGCCCTTGTATAACAGACCACAGGGTTGATACCAGCTGTTGTTCCAGCA 
TAGTCACCCCTGCTCAGTCAGGGCCAGGCCTGAAGCTCACCTGTCTTATAGAC 
ATAATCCAAATGCAACCCCAGTGTTGAATGATAGGCCTCAAATTTCTCAACAG 
AACACATAAATACCTGTTACCCCTCCCTCACCCCCATGCCTGCTCCTGCTGTC 
15 CACTTGAGGTCAAAGGTTATGGGCTATCACCCAAAGGGTCTCATGGCCCAGT 
GCACCCAAAAGCCTCCCTAGACCCCAGACACATTGCATCCTCTGGGTTCTCCA 
TGTCCTCAGAGGCTTAGTGTGTGGGGTGTACTCTCACCCCCACCTTCCAACCC 
CTTGCCTATTTAGACCATCCCAGCATCCCCCAGACCTTGTAACAGCCACCTAA 
AAAGCTAGCATGCTCCCCCCACCAATAAAAGTACCAGCCTAATGCACCAGTA 
20 TTGTCCAGCTCCCTAAAGGGCCTCCCTGAGAAATCCACATGCTGCTGCCAGAA 
CCAAGGACTCAAGGCCAGGGGACATTGATGGAGTGTGGCCCTGAGCTCTGTC 
CCAGAGATATAAAAGAAAGGGGTGTCCTTGGGCCAATGAGCTTGGGCTGAGA 
CAAAAGGCAAGAGAACACAGTGGCCCCTCCCACCATGACCATTGTTCCCTGT 
ACTAGAGTTAGGCTAAATCCTGGCTGCAGAGGGCTTTGTGGCTGAGACTCCTG 
25 CGTCAGGGATAAACAGCGAGCACAGAGCCTGCAGCTTTTCCTGTATCCACCA 
AGGCAGGAACCCTGTGCCTCTTTCTATACACAGGGAGGAAAGCCCATGAGAA 
TAGGAGAATACCTGCCCAAGGATTCATGGGAGGAATAGACTGACATGGGCAC 
AAGGCAGAGGATCACACAGGCTCTCAGGTTTGGGAGTGGACTCTGGCAAGTG 
AGGCTAGTTQCCTCTTGGAGGGCTGACTACTGATACGTG, tGTACATCTGGGA 
30 ACATGAGGACAGAAGAGGTTTGGGGAATGGCTGTGAGTAGGGGTTGTATAGA 
TGAGACAGGGTCTTGAGTGATCAGGGACAGAAACCAAGGTGTAAAAGGAAA 
' TGGCCTAGGATGGAGGGTGACCTAGAGGCATAAGAGCCATTTACCAGGCCAG 
CTGTGACACGAAGTGTCCAGGGTAAGGACAAAAGATGAGCCTGGACCATGTG 
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CAGGTCCTGGTACCCCATCCTGGTGAACTTCTGTGCATGGCAGGAGAGCTAGC 
ATTTATGCAGGTGAGTCTGGGATTATAGTGGGTGTGAAGAGCCCTGGGGAGA 
TACAATAATCCTGGCTCTAACATACGTGTGCTTGTCTCCTCTAGGTCCTTGAA 
GAGCTGGAGGAGAGCAGAGGGGCCAGAGTCTGAGTGCTGGCCACTCCTCCTG 
AAGCTGTCAGAGAAAAAAAGAGGGGCTTGTCTAATGGCATACGAGTGCCAGA 
GCCATGAACAAGAGGTTCGGACAGGCTCTGAAAGCAACAATCTCTAGAGTGT 
ACCCCTTGCAGCATCCTGTAGCCTGTCACTTTGCCCATGTGCTGGTCCCAACC 
CACATGATATACTCTCTGACACACTCTGCTTATTTCCTATGCAGTAAATGTTG 
AGAGTTGTATCTCAATGGAGTGAGAGTCCCATAATTGAAAACAATTTTAAGG 
GCTGGCGAGGTGGCTCAGTGGGTAAGAGCACTGACCACTCTTCCGAAGGTCC 
TGTGTTCGAATCCAAGCAACCACATGGTGGCTCACAACCATTCATAATGAGAT 
CTGCCACCCTCTTCTGGTGCATCTGAAGCCAGCTACAGTGTACTTATGTATAA 
TAAGAAATAAATCTTTGGGCTGGAGCAAGCAGGGACTAAATGAGCAGAGTTG 
ACCGGAGTGAGCGGGGTTGGCTGGAACAAGCAGGGGTGCTAAAAAATTCAAT 
TCCCAACAACCACATGAAGGCTCACAACCATCTGTACAGCTACAATATACTC 
ACATACATAAAATAAATAAATAAGTCTTTTTTAAAAACTATTTTAAAAAGCAA 
AGCTGCTGGCACTGAGGCTCAGTTGATAATGTGCTTGCCTAGCATGCATGAAG 
CCCTGGGTTATATTCTCAGAACTGTATAAACCAGGTGTGGTACTAGATACCTG. 
GACTCCCAGCACTAGGAGGATAGAGACAGGAAGATCAGAAGTTCAGGGTCAT 
CCCTGGCTATGTAGAAAATTTGAATCCAGCCTAGATTAATGAGACCCTGTTTT 
AAAATAAAAATACCGAAGAAGGAGGAAGAGGAGGAGAGGAGGAGGAAGAG 
GAGGGGAGGAGGAGAGGAAGAGGAGGAGGAGGACAAATGTTTTAGAAAGGT 
AGAGAACCTGAATAGGCAGCAATTTGGAGAAGACATACAACTGAGAAAAGA 
TGCCCACAGAAACAGTTATTCTATGCCACTAACTGTGGAAATACAAATCAAA 
ACAGAGGCTGCTGTATCCCTCTAGCAAAACACAGGCAATGACTACAGTGGCC 
AAGGAGGCAGAGAGGCAGGGCTATTATACATGGCTGCCTGAGGGAGACACC 
AGACCCATGATCCTGGAAGCATGGTTCAATGGTTCCCTAAGTCATGAAGAAT 
GGAGTTACCATCCACTCAATTCTGCTCTAAACAAGGATCTATCTTTAAAAAAA 
AAAAGTACAGTTGTGGGACTGGGGATATAAAGTATATAAAGTATATCCCCA 
GTCCCACAACTGTAGAGCACTTGTCTGGTGTGCCTGAGGCCCTGTTTAACCCC 
CAGCATTACACAGAAATACATCAATAAAAACAGAAAGATCATTTATTTTTCTA 
AAAACAAAAAGAACTTCCACAGAGCCCAGCAATTCTACTACATCCTGAAAGA 
GAGGCAGGGTTTTGAGAAGCTATCTGTCCACCGTGTTTACTTTGGCAAAGTAC 



WO 02/054069 



17 



PCT/US01/49808 



ACAATAGCTAAAGGCAATGTCCATCAAGAGGACAGAGAAAACAGGCTCCAG 
ATACTAGAGGGCACATAATATTCCTTCAGAAGGATGTATGCTCCAACACAAC 

ACAGCATGGACAGACCTTGGGAACCTTCTGTTGAGACAAAAGTCCTGAAAGA 

TCCCCCTCATAGTATAACTGCAGGGCCCCAAGTTACCAAAGCCACAGGGACA 

CAAAGCAAGATGAGGTGGCAGGGCATGGGGGTGGCGGGGTGGGGGTGAGTG 

TTTAACAAGACCAGAGTCATCTGTGAAGATGGAAAGTTCTCAAGGTACGTGG 

TAAGGACTACTACAGTGTGAATGAGCTTCATGCCACTCAGCTGAGCACTCAG 

ACATAGCCAAGGTGAGACACCTTACCTCATACATATTTCAGGAAGGGGAGTG 

GGGACAAGATGGCTCAGCAAGGAAAGGCACCTGCTATTGTCAAGCCTATTGG 

CTCAAGTTTGATCCCCAACCCCACACGATGGAAGAACAAAACTGACCAATGC 

AAATTGTTTTCTGACTACCACGTGCACACACAGACACACACACACACACACA 

CACACACACGTACCTGAGAAATAAGGGAATACTTGTAATAAAAGAAGAAAG 

AGACAGGAGGGGGAAGATGACATGGAGTCACAAGGCTCACAGGATCTACGG 

CCTGCAAAGGACCAAGTGGACCCTCACTGTGGCTCCCATTGTAAAGATAAGA 

AAGACGGTGGTAGAGCAAAGACAGGAAGCATCCAAGGTCTAATTAGGTCCTT 

TGTCATCTCATCTCACTATGGCAATGCTGGAACTCACTGAGGCACAGGCTCGG 

TGGCATCCTGGGCTCTGGATCCTGGCAGGGACATCTGGGAAGAGACACGCTG 

CCTCTGAAAACTTCACAGTGGGAACTGAAAGTTTTCTCATTTGACAGGTGGAC 

TCAGTACCCCCAAACTGAACCCAAGTGATTCAGGGGATGCAGAAGACAAAAG 

CTTGGCATGACAGAATCTCTGACTGGCCGGGAACACCTATCACACATGCATA 

CCTACTACACATGTGCCTCAGGTGCTTGGGGGTAAGCACAGCAAAGCAGAGA 

TCTGCCCGTATGTGAACCTAAAGGGCCGAGATACAAGTAAGGGAGATGCTCT 

ATCTGTCAACAGAGGAAAGGCCAGTGCAGAGTTCACAGCCTCCGAGGCCCTT 

CCTTGAAGGCTGTGGGTGTCTTATCTATTCTTTCTTTTTATCAGCTGGTGAATC 

CCATTTCTGGAAGGGTGGGGGTTCAGCTTACTACAATCCACCTGTGGCCCACA 

GCAAAGCCAGGGCAGGGCAGACTGCCTTGGATCCTCAGATGAAAGACACTTA 

TTTTTTTTCCCCCTCTGTTTCTTTTTGTTTAACAATCAAGGGAGCTC 

CTTATCGTGAGTATTCAGTGTCTGTGATGTTACTATTTTATAATCITCrTAATA 

AAAGAATAATTTGA/'VTCTGACTGGAAGATTTAACTATTTCAGATGAAGGGT 

TGGCCCATGGGAATAGGGGCCCTGCAAGAGAGCCGGCAAATCCAGGCCATCA 

GGCCAAAGGGCCCGGCTGGTCTCCCTGAGGCCCTCTAAACACTGGACAGGGC 

AGTTTCTAGAGAGTTAGATACCTCTAGGCCTCCTGTAGGCTACCAATCAGCAT 

AGGGGTGCCTTGGGAAGCTAGGGCTCAGACCCTTCCCTCCTCTCATAACTAGG 
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AAACCTGTAAGATGGCCATTTCCTGAGCCAGGGGGCTTCCTGTGGGAGGACA 
GTGCCTTCTCAGCACACAGACAGCCAGCAGACTTTGGGGCCAGCTGTTCTGAC 
TCTACACGGGCCCCAATAAGCCATAGAGAGGGCCAGCCTTGCATTCTCGGGA 
GGGGCAGCTGGCCGGAGGCTCGGTGCCGGCATGTAAGCTGCTCAAAGCACCC 
5 TTTGTTCAGGTCCTGCCTGTGGCCAGCTGCTCCCAGCCGCCAGCCTATGCTGA 
GAGCCCATGGGCCCCCCCAGTGACATGTGCTGGGCACAGCTGGAACCCTTAG 
GCCATGCTCACTGCCCACCCTGGGGATTGTCAGCTCCCATAGGCCACCAGCTA 
GTCCCAGGTGACACAAAGCCTTCAGAGACTCTAAGGAGGCACAGAGTCAGGG 
CCTAATGGCAGTAGACAGAACCAGGACCCTGTAGAGACATGATCCTTTCTAT 
10 AACAGTCCACTCTTCTCCAAATGTCCTAGGAGTCATGGGTACCTACTACTGGT 
GGGGTTCTGGGGGCACACAGCACAACACAGCCAGGAAGCAGGGCATTTCCAC 
CAGCCACCATGAAAGTTAGTGAGCTATAAGATAGAGCTAGAGGCAGAAGCCC 
CAGTCCTGGGAGTGGGCAGCAGAGTCCCACACACGCCTTCTTCAATAGCCAG 
TTCCTAATGGCACTGGGCACAGATCAGCAGCTGCTATGTCCACACTTATTAGG 
1 5 GTAGACCCAATTTGGGGGCAACTTTAGGAGATGCAGGGTTGCTTTCCCCTGAG 
AACCCATCCCAACTGCCAGCTGTCCAGGCAGCCATTCTCCACAGTGGGCAGA 
AGAGGCGATGAGGCTTGCTCAGTCACCATGGGGACAAGCTTTCCCATCCACC 
GGGTAGCCAGGCTCTGAAAGTGGGCAACAACTGTCACCTGGTCCTCTAATCA 
TAGACCCTACTTCAGCAGGTGGCATGAGGCCCAGTGTCCACTGCTGAAGTTTA 
20 CAGGGGACCAAAAGCAGAGGGAGTGCCTTGAAGACTAATTGTTTCTCCTGAT 
CATGTTCTAAATCCATACTGCCCCATGTAAGACTCCTAAAGGGTGCTAAGGCC 
CATGTCTCCCTTTCCTATATGATCTAGAGGCCACCTTCAGGCCAAAGCTCAGC 
CACATAGGGTTCCCTGGTATCATCCCTGACCCTCACATTATCATGTTACAGGT 
ATAGAGGCTACCACAACAGGATGTTGGACATCCCGACAGACTGTCTCTGAGA 
25 GCTTCCTAACAAGCTGGGATGTCCAACAATCTTAGGAAACAATAAAAGTATA 
ATCGGGTTGCCCAGGAGGTACCTCTTGAGCACACTGTGCCCCCAGGTCAAGG 
CrCAAGACCTTCTGCCCCAACCAAATTCCCAAAGCCCTACTAATCACTTACTG 
ACACTAGGTCTAAACAGGAGGCAGATGGGGCAGAGTGAGGATAAAGCAAGG 
AGGCTTGGCCCACACTAGTGGGCTAGCCCA^TCTTGGGCCCATTATCATTAGG 
30 AAAAATCCTCCTCAGATTGGAGGCTTCTAGGCCTCAACCTTGGAATAAAGGCT 
ACCCACCCACAGTGGGAGGAGAGCAGGCCCTGTGGTCACTTTAATTTACATG 
TTTCGCTGGTGCAGGGCTGTGTGGTACTGACTAGAGTTACATGCGGTTCCGGT 
GGTTTAATGAGCTGCCTCCAGCTCTGAGACCCAGCTTGCTGCAAGCTGGCAAG 
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GTGGGGGTCTTCATACCCTGCCCACTCCCTTCCTCCCTGCCTGGGTGTTCTAGA 
GGTCAGGAGGTGAACATGGGGGTTCAGTGGCCTCCCAATGCCTGGTAGGGGG 
CTGGGCAGATGGCGCCAGGATATGCACATAGACCAGCCCAGGAAATCCCTGC 
CCAGAATTGCCTCATGAGCAGATTCAAACCAAGCAAATTCATGCTATCCCTTA 
5 GACGCCTCCAGTTGTGAGCCAGTGCCTGACAGAGAGAAGGCAGCCTGGGTGC 
TCTGTAAGTGTGTGCCTGCATAGGACAGGGGGCTATGATTTTTTATTTCATGA 
ATTTCAAACTAGGTGGAGGTCACAAACACCAGCCAGCAACTGAGAAGCATGG 
TGACAGGCAGGGGTGGAAATGAAGTATCTGCAACACTGGGGAAAGGGGGCA 
GTCAGAGCCTCACTGGGGGCAGGACGGATAGTGTCTTTTCATTAACAGAAGT 
1 0 C AGCTCTACATAGCTGTTCACAGCTGCCCTGTCTTCCCTCACCCTGTTGAGTTT 
CCCCAGCTATTTAGGAGAGTACCTCTCTATTCCTGCTAAGACTTCTTTTTCCCT 
CTCAATATTTTAA'CCATTTGTTTCCATGTTTCGAGTTTTATTCATTCTTTTCTCC 
CCTTTTTAAAATTAATTAATTTATTTATTTTTGTTAT^ 

TGGGGCAGAGTCTCATGGAGCCTGGGCTGACCTCAAACTCTATGTAGCCAAG 
15 GATAACCTTGACCTTCCAGTGCTCCTCTCTCTCTTCATCTCCCAGCACTAGGAT 
CACAGGTGTGTACACTGTGCACAGTTTATACAGACTGGAGATTGAACCCAGG 
GATTCATGTATGCCAGGCAAACACTAGCAGCCCTTGCAGTAAGTTCGGATCA 
GAGAACACCCACATCTGTCTCAATAGTGTAAGGCATGCATGTCCTTTCTTTTA 
ATAGGAGCTCCATGGGTGATCGGCTTCAGAAGTCTTCCATGGGTCCTCTAAAG 
20 AGACAGAGAATACCCACTATCAGGCTCCAAAATCACAATGATAGCTTGAAAA 

cctgctccccatacaccttgccactcctgtcctggaatctgggctttaagatt 

AAGAGAGTCAGTCTCAGTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTC 

TCTCTCTCTCTTTCTCTCTCTCTCTAACCCAGCCCCCCTACATACACTACACAA 

TTGCTCCACCCCTGATCTTGTGGACATTTTTTTCCCTTTCTGGAATGTTCATTTC 

25 TAAAGTAGAGCAACCCAACAGGCATAACAGCCCTAGAGGAGGGTTTCCCCAC 
ACTGTTGCCCCTGTGCTAAGTAGAGAGGTGCCAGTCCCAGAGCTGTGCAGCCT 
ACAGGCTTAGAAGCTGTGCTCTGGGTTTGTAGGTGGGTTTTTGTTGGGGTAGC 
ATGGCTAGGGATAATAGCAACAGTCGTTTCAAGGTCCTGTTACAGATTCTGAA 
CC ACTTACCTGCCT ATGTTCTCTGACCCTGC ATCTAACTC AAAGA( r r ATTCTGA 

30 AAGAATTGCCTAATGGCATCTGTGATGCTTAGGGTTGTCAGCCTGACAAAATC 
TAGAATCACCTTAGAGTCTGGGCTCAGAGTATACCTGTGGGGGCCTCTCTTGA 
TTATGTTTGTTGATGTGGAGAGACCTAGGTTGTCTAGGTGGGAACAGCGAGC 
AGGACAATAGCAAGTACTTCTTCATTGTTTGCTTCTTCTGATGTGACATAGTG 
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TGATCGGATGTTTCAGGCTCCTGCTGCCTTGACTTGCCTACCTCAATGGACCT 
GTGAGGCAAGATAAACCCITTCTCCTTTGTCAGGGTGTttAATCATAACAACA 
GGAAAAGCTAATCCCTCATAGTCACCATGGtTAACCCTCITTTCCACATACCA 
TGTGAtTTTTCACAAGTGTTATGATGTATGCTGTCTTCATATTAATAACTAGGC 

5 TTCGTCCACACAAACTCAACCTGCTGTGGCCTGCAAGTCTTGTTCTCTATTTTT 
GCGTGCAATTTCTTTGCATAGTTTTGGCTATCCTTTTTTTTTTTCAA 
TATGTCTTCTTTACTAAATGTACTGGACACACCTGCTTTTTGTTAAAAAAAAA 
AATCATAGTGGGGCCATAGGGTCACCAGTAATATGCTTATCTGGCATGCACA 
AGGCCTTGAGTTATATCCCTACTAGCACAAAATTAATTC'rill TAAAAGAATA 

10 AAACCCTATATATTAGTCCTCTACTTTTTGTGAGATTTTGCATATCTTCTTTTG 
GAAGATGAATTTAGGACATTAATATTTAATATAAAATTGAATACACCTGACTA 
CATTTCCATCCTCACTTGTCATATGGTTTCTTTCCTTGACCCCCTTTCCTGGGA 
GCTGCTACGTTGGTCAGCACCACCCTCCCTCCTTATCTCATGGAAGAGGACAT 
TTTACTGTTCTCTTCCATTCTACCAGACCCTTAAATACATCTCTTATCTAAAAA 

15 AACTTTCCATAGCACTGATTTCTTGAGATTTTCTCTGGCAAGCCAGCAATCTG 
TGTATGGCTCTATTTCTTTTGACCTCCATCATCAAGGGCTCTGTGGAGAAAGG 
CACAGTTTAAAACATCTTGTCAAAAAAGGATGGACCATCTAGAGACTGCCAT . 
ATCCAGAGATCCATCCCATAATATGCCTCCAAACGATGACACCATTGCATATA 
CTAGCAAGATTTTGCTGAAAGGACCCTGATATAGATGTCTCTTGTGAGACTAG 

20 GCCGGGGCCTAGCAAACACATAAGTAGATGCTCACAGTCAGCTACTGGATGG 
AGCACAGGAGGAGCTAGAGAAAGTATCCAAGGAGCTAAAGAGATCTGCAAC 
CCTGTAGGTGCAACAACATTATGAACTAACCAGTACCCCAGAGCTCTTAACTC 
TAGCTGCATATGTATCAAAAGATGGCCTAGTCGGCCATCACTGGAAAGAGAG 
GCCCATTGGACAGGCAAACTTTATATGCCCCAGTACAGGGGAATGCCAGGGC 

25- CAAAAAATGGGAATGGGTGGGTAGGGAAGTGTGTGTGGGGGGGAGGGTATG 
GGGGACTTTTGGGATAGCATTCTAAATGTAATTGAGGAAAATATGTAATAAT 
AAAAAATATTTAAAAAAAAAAAAAAGAACGTCTTGACAAAGGCGAGGCAAT ' 
TGATGCCTAAGGCTCTGTGGCTCTCATGGCCTCCACCCTGAAGTGGAAGCGAC 
AAG AG ' AGTACATCCACAAG AATTTATGCTATGTACTCATAAAACCTGATTT 

30 GTATAGACATACAGGCAGAGAGGGCCAAGTTCACAGACTTCCCGGTTCCTeA 
TCTTCTGGTGGGTGAGCGTTATCAGGTAAATGCCATCAACACAAATTCCTGTC 
AATATCCCACCCTCAGGATGTCAGTATGTGACCGTATTTAGAAACAGGGTCTT 
TTGCAGATGTAATTTATGATGAAGCAACAATAGCATGCAGTGTGCTCCTCACC 
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CAACATGACTGACAGCTTTACAGCAAAGGGACAGTAAAATCCAAAAACAATG 

GGAGAAGGGAAACCAAGTGCGTATGAGGGCAGATGTGTTTTTTACAATGTTT 

TTGTAACAGAACATCAAGGATTGCCATCTTCCAGAGACCAGTCCGGGAATGG 

AATGGGTTCTTCTCACGGCTGGAAGGAACCTACCCTTATCTTTGACTTGCATG 

CTAGAAAGAGTGGCCCCTAATCAGATGTAAGCTGTCACTCAGACCATCTCAG 

AAGCTGTATCTCTAGAGATGTCTCTTCCATCAGGCCAGAAATGACACCAAATC 

ATCTTTCTCCTTGTAGGCATATTGGCCCACTGCTTCCTCACACCACAGTGAAA 

TGTGGGCTTTGAAGAACAAACCCAAGGCACTGAAGGCTAGCAAGGCCCCAGG 

TGTACTGCTCTGTCCAGACAGAAGGGGAGCTCTTAACCCCTATCCAGTCCCAC 

TCTGCATAGACCCCTGAGAATGCCTTTGCATATAGGATCAACCCCAGTTGTCC 

ACCCTGAAATCTGTGCTAATTTGCGGGCAGTGGCCCAATATTGCTCATACCTG 

AACTCCTAGTCATCTTCACATGAATTTAAAGCAGGTCTCTACCTCTGCCAGGA 

AAGTTATGCCCATCTCTAACCCAGTTACGTGGGGTTTTACTCTTAGATCCTGA 

CTTCACTACCAATGGCTGCCAACCCCCATGGAGAGACATCCAAAGATGCGTG 

CCTTAACTGAGTCTTGGATGCATTTTCATGTGTAGGGAGCGCACACTTCTAAA 

ACTACCTCGATCAAGAATTTCCACAGGACTAGGAGTTTAGTGAAGCTGAGAC 

CCTCAGGGTGAGCACCCTTCACCATAGCTGTGGGCCTAGGCAAGCCAGTTGA 

GCCACTGCCTGAATTCTATCTTTGTGGGTGTCTAGGGAGTGAACCTATTCAGG 

GAGGCCTGAGACCCACAGGGCATGGGGATGAGTGGACCACCTGACCACAGGT 

TGTTCITTGGCTTTTTGAGCATGTGGTGGGGACACAGTGCTTGTACCTGGCTT 

ATTCATTGTCCGTTGATTTGCACATAGTCATTGTGCAAATTTAGACATCTTAAT 

GAGGGGAGAATTTAAACAGTACTTCTTCTTCCTGTCATGAATGTTTTAAGGGA 

CTGAGGAAATTGTGCACGAATTTGCTGTTTTAGGAGATGGTAAGTCAACAACT 

TCGGGGGAAGTATACTACTTCAGGGCATATGTTTACACAGTTCCTGGGAGATG 

TAAAAGTGACCTACCAAGGATTCTGAGTGCTCATCTTTTCATCCTTGAGTTTC 

ATTeCTTGGAGAAGGGGCGACTTCCTGGAGAGTCCAGCAAAGGTCCACTCTC 

TGCCCTTTCCCCCGTGTCCTGCCAAACAGGGTGCCTGCAAGACTCTTGGATGG 

TAGACAGCAGAGAGCATGGGGATAGGAAAGGAGCAACTCTGATGCCATAGTT 

AAGCCTGAGTCCTCTCAGAG'.TCTGTGGGTGGGACTCTGAAGAGGATGTCAT 

GTACACACACACACACACACACACACACACACCAAAGTTAAGCTTAGAAACA 

CCCCCAAGATGGACTTTTTGTGTTTTTAAGATGGGGTATTGTTATATATAGCC 

CTGGCTGGCCTCTCAGTCGGTATAAAGCCCAGCTTGGCCTAGAACCCATGGCT 

GTCTTCCTGACTCAGTCTCCTAAGAACTATGTGCTACCACACCCAGCTTCAAG 
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ATGGCCTTGTCTTTAAACTCAGGATGAAGTTCCTTTGCTCTTTGGGGATTCGAT 
GCATTCCAGGATGAGAGTGGCCAAAATTTGGCCAAGGAGAAATTCAAAATTC 
ATGGGTCCAATGTTTACtAATTTATGTCAAAAAAAAAACAAAAAAAAAAAAA 
AACAAAAAGCAAAACCTTTCCTGCTAGGAAAAAAATGCAGATGAAATTAATT 
5 CGTGGGTCTGTCTAATGTGGAAAAGAGAAATAGAAGACATTTTATACAAACC 
TGAGTGAAGGAGCAGGGGGTACCTCCCTCACCGACTCACCCTCGGGTACCAA 
AGCAGAGCACACGGCAGGACAATTAGGACTGCTCCAACGTGAGCATAGACGT 
AAATACTTGACTTAGAATCTCGGTGATGTATGTTTATAGGCATTGTGATGGAC 
ATGGCTTTGTGTGACAAATGCGAGGGGTTGTTTTGTTCTTTACAGATCAATGA 
10 GCACAACTCATAGTCTTAGAGAGGAAAAGGTTCAACTATCTCAACAGGAAAG 
GCATTGGGTACCGCCCAACAGTAACTGATACAGTCCCCAGGCCCTGGAGACT 
AGTAAAAAACAAAGAACACACCCTGATTACCATCACCAGCAACCTCACACCC 
GTCAGTTTTTGTGCTATCAGACACCAGCCCTAAGGACCACTTGGCCCCTCTCT 
GCCAAACTGAATGTCTGGTGGAAGGTCCCCAACTGGTGCCATGAGGCAAAAC 
15 CAAGAAATAAAAGAGATGTGGACCAAAAAGGAAAAGTGTCAATGGTTTTATT 
TAGGTTGACCAGACATCTCCATAGAGAACCCTAAAGAACCTACAGCGAGCAA 
ACAAAGCAAGTGTGTAGTAGCTTCCCCTCTCCTAGTCATAAGTAAGGTGAAC 
ACAAGAAACAACCGCTATCCTATAGCCGAACAACAACCCCCAAAATACATTG 
AGGACAGCAGTGTCCTGTAAGGAGCTTAAAACCACAAAGGATAAAGAAGCA 
20 GCCTAATGGGGTGGGGCCCTGGAGATTGCATGGCACTCACAACCTGCACTGA 
AGATTCTTCACCAGGTAGAGAGAGAAAGCACAGTGCAGTTCAGATGGACATG 
AAACCCATGCATCTTTACCCCTCCCTCCCACAATGCCAGTCAAAATGCCAAGA 
GGGCTGGGTATTGGTTTTGATTTGGGATTGTTGTTAAAAACTGACAGCCAGAT 
TCTCTGATTTTTCCTGAGATTGTGAAGGGCCCAGCAGAGCATCACACTTTGAA 
25 AATGAACCAAATTGGAGGATTAACAAGGCCTGACTTCAAGGCTTAGAGTAGA 
AGTCACAGCAATCAAGGCTGTGCTGGCACAACAGCTTACAAAGTCCAGAAAT 
TGGAGCCAGTGCATGGCCAACTGATTTTCATCAAAGGCGCCAAAGTCATCTC 
ATGAAAAAAAAAAAAAAGTCACAGGGACATGAGGGTAACCACATGCAAAAG 
AG ACTTAGTCCCGATTCGTACCTGTCTCCAGGCAT/: AGATTAAACTTGGAAGG 
30 AATrACAAATCTAAACAAGAAACATGAAACGATAAAGTTTCTACAGAAAACA 
GGAGAAAGGCTTTGTAAGCTCACATTAGGCAAATATTTCTTAGACAGGACAC 
AAAACACATGAATCTTACAGGAAAAGGGGGCATGGAAAACACCCTGATGAA 
ATGAGTTCATCAAAATTAAAAACrGCTTTCTGAACGATCTTGCTAAGAAGAAT 
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CAAAGACGAGACCAGGGGAGGAAAGACCTGAGCAACACAGATCCCCGTGAA 
AGACAGAGTCATGGACACCATCTCCTCAATTAGCACAGAAGAGACATAGAAT 
CATTAAAACAAACAAACCAACAGAAAAGCTACGGTCCTCTGCAAGAGCATCC 
AGTGCTCTTAACCACTGAGCCATCTGTCCAGCACCCAAAGCAGACAATTCTCA 
5 GTGCTAGTGAGAACCCACAGTTACCATGCTGGCACCTAGGATGGTGATAGCA 
GTCAGGAGCAGCCATCTTGCCAAAGGGCTTGGCAGCTTCTGAGCATGCAGTA 
CTCCCACTCAGGGACAGGCAGGGAATAACCAGCGAGCAATAGAGGCAAGTA 
ACAACTCCAGAGCCTGTGTGGAGTGCTTGTGGCAGCTTTAATCATAATAACCA 
AGCCCTAGAGACGGCTCAGATGTCCATCAGCCTGTGGACGGGTAAACAAGCT 
10 GCAGTACACTTTATCCTTTCCAAATGAGAGAAGAATCAACCAGAACACAGAC \ 
CAGAAGCAGACGGGCCTCAAGTGAGTCTGCAAAGTGATGAGAGCCAGACCCC 
GGGCTTCATGCTGCAGAACTGCGTTCTCAGGGGGACAGGACAGTTCCCAGGT 
GCAGGTGCAGGGATGGGCTGTGGTACAGGGCGTACAGAGCGTGTGAAGGAA 
CTCAACTTGGCCTAGGAATTTGCCAGAACTCACTGAACCGGACATCTCACATG 
15 AGAGAGAGTTrrGCTGCACATAACTTAAACCCTACAAAGCTAAAACCTAAGA 
CAAGAAAACAGAAAATAAATAAGAGAAGCCTAGGAGATGGGTGCATCACTG 
GTTACTTTTAAAATGTTGAAAGAGGAGCTGGCAAGATGGCTCGGTGGGTAAA 
GACATTTGTCGCACAGGCCTGGGACCCCGAGTTGGATCCTTGAATTCCATTAA 
AAGGTAGAAGGAGAAAACCAATTCTACAAAGTCTTCCTCTAATCTCTGTGGG 
20 TACAACATGGCACGCATGCCCATATGTATGCATAGACAGGCACACACAATAA 
TAGTAAGTAAAATTCTTTAAAATGCCTAGAGTTTAAGTGCTTAAAAAGAGAA 
ACAAGCTAACATACTGGAGACTTTAAAAGGAACCCCACCACCACCACCACAT 
TAGGTTGGCTGTAGACACTCATCTGTCACAGCTCGTGAGCACTGTCAGTGCTG 
TCCCCAGCATGGCCACTAACCTTCCTCCAGCCCCGTCCCTCAGCCTCTGCGAA 
25 TCCAGCCCAGCTGTGACTCGTTCAGCCATTTTCCTTGGAGGAGACAGCCAAAT 
GACATTTTATGTTGGGCTAACTTTACTGGTCTCTGCACTTGTCACAAACTCTCA 
AGTGCAGCTGGGCCACAGTCATGTCTCTGGGCCTGGTACCCAAGACATGGGT 
GCCGCAGACTAAAGGGGGTGAATATCACCCTGTATCTGTATTTTGGTCCCCAG 
CTGAGTATGCACTATGCACTATGCAAGCTGTACAGCAGCCAAAAGAGCT" AA 
30 TAAGGATCACAAAGAGTTCCGACACTCTGGTATCTGCTGCTTTGGCCTCTCCT 
AGGCCCTTCTAGTTAATCCTCCAGTGACACTGAAATGGAGACCAGGAGGTGA 
TAGCAAGAGTGGAGAACCCACAAAATGCAAAAACTACAACCTGTGTTGTGGT 
TCCTCCCTTGTTGAAGTCAAGCTGGGCCAGGAAGTCTGTCCTCCTGTAGGACA 
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GGGCTGCTTGGAGGCCAAAATCCAGAGCCAACCAGGGTACAGGGTAAAGCTA 
GCTTCACTGCCAGCTGTGTCCCAGCCTGATACCTCTGAACCTTGATAGCTTAC 
ATAGGGTGGCCCCTACTCTAACATTGCTGAGGGTATCAGGAACCATGACTCCT 
CCTCACTTCTAAGGTCTGTCCCACCCAGGCATGCCCACTGCCTTCCCTAATAT 
5 CTGTGGGCTCAATACCTCCCAGTGACTATCCTAGGGCTCTGCTTGGGAACACC 
AGAGTCCCTGTGACTCTGCAGAGCATACACTGGGTGTCAGGCCACCAGCCCA 
CATGGTTCCACATTCTACATGGTAGGATGCACAGGGTCACATGGGTATATGGC 
ACCATATGCAGCTACTTACATCCATGCCCCATAAGACACCCCAAGGCCTATCA 
GGCCCCAGGCAGCTGGAATGCAGCAGTGCACACAGGGCCTCTTTTTTCTCCCC 

10 AAGTACTGAGATGGAGGCCCACCCTCCTGGCAGTCACATTGGCATGGTTACT 
GGAAGTATCCCAAGACCTCCGGACTTAGACTGGTAGGCACTTTGCCTCCATCC 
TGTCTCTGTGAGCCCCAAGGGTCATAGATCACCTCAAATTTTCAGGTGGAAAA 
CCAGGACTCCAAAGTGCTCCCAAAGGCAGGCAAGAGTAGAAAAAAGATGGC 
AGACTGAAAAACAGATCCAGCCAACAAGAGATCCAGGTGGCCAACCACCAG 

1 5 GTGTCCAGGCTACGG AC ACCTATCGTGGAGACAACATCTCAACACGAAAGCA 
ATCTGCCTCCCTGTGATCTGACACCACAGGCTCACGGTACCCAACAAGATGCC 
AGGGCTCGATAATGGTCACTACAATGTGGAAGACCAAGGGGTTCAAAATGAA 
TATGGCTGCTGGAATCAGCCCTTCCTTGGGAGAGGAGCTCTGACACCTAGCCT 
GGAGACAGCCCTGCGATTAGGACACCTAGTTCTTTCTGGGTACCCAGTACCTG 

20 TCACGGAACATCCTCCAAAGTGCTGGTGTTTGAGCCATAACAGCTCACAGGA 
GTTGGGGTATGGGCAAGGAACAGAGCCCCTTTGTTCCCTCCAAATATGTCCTC 
ACTTTACCCCAAGTTGAGAGTGGCACTTCCACATCAGGTGGGCCAGCTCTCTC 
TCCCTCTCCCCTCCCTCCCCATCCCTGTCTCTCTGTATGTGTTCACATGTGTGT 
ACATCCCAAGTTCATTAGCCTATGTCCAGAGCCCAGAGCTCTTATGTGTGTGC 

25 AAGCAATCTCTTCTCCTCTCTCTCTCTCTCTCTCTCTCACACACACACACACAC 
ACACACACACACACACACACACACACACACACACAGAGTCAAAGAGTGATG 
GAATGAGACACTCCCCCTTCAAGGGGATCATTGTATCCAGGCTCCTCCTTCAT 
ATACCTGCAAACTCTGATTAAGAAAAAATTTCCCCAAGGTATTGGCGAACTCT 
GAAGATGTC1 ? CCAGCGCCTGGCC ATTGGCAAATGTGTCCCCTTGGGGTACCC 

30 CAACTACATCTACAAGTCTTAAAAAGAATGAGAATCCAAGCAGTTCCAGAGA 
CATTTTGGTTACTTAGGAATTTTCTTTGTTTGACCTGTGATGGTCTGAGTAAGT 
ATAAGAATGCAGtCGGTTACATGACAGATTTATGTATGCATGTGTCCCACTCC 
ATCTCTGATTTCCTCCACACAGCCAAATTTGGTTCTGATGGCGAATGTAAACC 
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TGTCAGAATCAGTGCAAGCCTTCTCTGGATCCCTTGAGTCCAGCCTGGTCCAG 
GTGTGTGCCTGCATGATGCATGTAGGCAGTGACCTTCGGGACAGTGATGTTCA 
TAAAATAAACAAACAATCAAAAAAAAAAAAAAACCCTGTGCCATGCCACTTA 
AATTCTACTTACAACTTTCTCATGGAGAGTAGTTGGGTGCAGATGGCATCAGG 
5 ATAAAGCTCACTAAATACAAGAAAATTATCCTATGACAACCACACACAAGGT 
AGGGCATGTGCATGAACACACACTAAAGAAATAAATCTAAATTTTTAGTAAA 
TAAAGAAATTAAAGATCAAGATGTGTTCATGTATTACCAAGATGCTAATTTCC 
CCCAAGCTGACCTCCAGATTTGTATCCCCAATAAAACTCACTCTGCTCTTTCA 
TAAAAATGAGCAAGCTATTCCTAAAAGTTATATGAACAGGCAAAAGACCCAG 
10 AAATAGCCAAAACACTTAAAGAATAAAACTGAACGTCACGAATGACTGTTTT 
TGCAGCCCTGTCTCAATGCTAGTAAGCACAGAGTTTGGCATTTCAGGATGACA 
GGTCAGTGACTCAGAACAGTCTGGCAGATTCATTCTTGTGGCACACTGGACTT 
TAAAAGGGATGGCAGGAGACTCAGGAAGTAAAGAATATTCGTTCCAGAAAAT 
GCTACTCAACCAATTCAGCTTCTACATAAAGACACATAGACTGCCAGCCACAT 
15 TTAGTACAAGGACATGGAAAGGACTCTGGAGTTCTAGTGCCTATGCAAGCAT 
GAGGGGCTAAGTTCAAGTTCAGGTCCCCAGCTCCCAGGCATGGCTACATGTG 
AACTCTGACCCCAGGACTAACAGACTGGTGGATCCAGTGGCCCTCTAGGTGG 
ACTGTTTAGCTCCACGTGCAATGAGAGACAGAACATAGAGAACAATGGGGGA 
GGGTTCTAACATCTGTCTACCTCTGGCTCCACAAACACCAGAATGGGTGAGTA 
20 CCCCATCACGTGTACACACACACACACACACACACACACACACACACACACA 
CACACAAGCACACACTCACATGCACACACACACTCACGTCACATCACAAATC 
TAAAACTATAGAACGACAATAAGAAAATATTTATGTCACACTTTATATATATA 
TGATTACTAAGTATGAATTAAAAGGTAATTAAAACAACAGAAGATATAAATT 
TGACCTCACAAAATTGAAAATTTCTITTTGGGGGTGATAGGAATGAAATCTGA 
25 GGCCTTGTGCACAGTAGGCAAGTGCTCTAGAACTGGGTCACACTCCAAGACC 
CAAACATTGTTTTTTTGTTTGTTGTTTGTTTTTAAACCACACT 
GAGAAGGCAAGCCACAGTATGGAAGAAGGTGGAAATTTCTAAGGTCAGGCCT 
GGCTTCAGCTCCAGGCTCTGGGAACAACCGCTACGTCCAGGGAACTGTCTGTT 
GTCAATATGAAGCAATGAGCAGGG-'wTCAAGCCTCATCCGTGAGCAGGAAGAG 
30 ACACAGCCATGGTAAAGAACGCAATGTAGTTCAGTTGATAGTGTGTCTGCCT 
AGCTTGTAGGAGGCCCTGGGTTTAATCTCCAACAGCACATAAACTGGATGtG 
GCAAAGTAGGGCTGTAATCCCAGCACTGATGAGCTGAGGGCAGGGGGATCAA 
CTCAAGATCATTTTCAGCCTCAGAGGGAGCTAGGGAATAGAACCACGTTAAG 
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AAATGAAAGAGAGGGGAAGAGATGAAGTGAGGAAGGAAQGAAGGAAGGAA 
GGAAGGAAGGAAGGATGGAAGGATGGATGGAAGGAAGGAAGGAAGGAAGG 
AAGGAAGGAAGGAAGGAAGGAAGGATGGATGGAAGGAAGGATGGATGGAT 

ggatggatggatggatggatggatggatggaaggatggatggatggatgga 

5 tggatggatggatggatggaaggatggatggatggatggaaggatggatgg 
atggaaggaaggatggatggatggatggaaggatggatggatggaaggatg 
gatggatggatggatggatggatggatggatggatggaaggatggatggat 
aaatttaatgtgaaagaaccccccaggcctccagagagctccacagcactca 
gacactgaccagctcccactctgcagaattaggctcCtaatccacgtgtgaca 

io gttctcatccaggctcctcgtgctgatagttcctgaattcaatgtgcaaatgg 
aacgggccccaagggcaggggtgtcccactcccggcaggctcggatccaagc 
acagcattcaaactgcataagtgcgtgactccaaagtatgtgcagataagag 
aagtgcacacaggtcatgcacatgtgtgccttctcacacatctgtgcatgtat 
gtgtggatgtgtatgccccaccccctacccccctgaacgtgcatatcacaact 

15 cagcgtcccagctgtgctcggctgccttitcttccaacctgctatttaacactc 
ctgagcaagtgtggaagccattgttactactcacagttcccagcctcttttgc 
agaccacaagtgctgggtccacctgcactcagcccccagagccacatctccc 
gagaagagtctgtgttataaatacccacgagaacacaaagctttgacaccag 
aggccagggcttgctttctcaagcatcgcccacataggggaggagagcacgg 

20 cccctggttgccacgaccctgaaggtacttgaggacatactcgatttctcaag 
cccagcagtccagacacactgagatggcctctgggtcagagcacctcccaca 
gctattgaagaggctgccatagcccagccccagccccagccccagcccttgg 

CCCCCAACCCCCAGTCCAAAAACACCACtTGCTCCACCCAGAGACCCCAGTC 
GCCATGCAGCCTTGTCACCAGGAGACAGACAGTCTACAGGAAGCTGCTGCTG 

25 GAGCCTTCCCTCACCGGTCAGCTCAAAAAAGCAAGGGGCCAAGGGGCCTGGG 
TGCTCAAGTGTGCCTCTCGACCCTGAGAGGCTGCCCCCAGCCCCCTCCCTCCC 
TCCCACCTGCGAAGTCACACAATGACATGCTTCAATAGCACACAGCCCGGGC ' 
CTCTGGTTTTCGTTCTGAGTGACCCTGGGAGGCCAAGAACGTGATGGGAGAC 
CCGTCTGAGTGTGATGCCCTGGGGGACACTCAGGCCCeCAGCCCTCGCCACTT 

30 AGGCAGCCCGCTCCTCCAGGCAGACAGCTGGCGGACTCCACCGAGCTGTGCC 
CCTCAGCCATGAGGCCCTGCGGCTCATCCTCTTACCCCCAACGCAGCCCGCAA 
GCAGCCCCCAGGTGCCCCACTGCAGAGGGGGTGCCAGGGCAGCCTACTCAGC 
CCACCCCACTATATTATGTCAGCTGTTAGCCTGCACTTAAAGCGAAGTGCTCC 
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AGCTGGCCAGGGTCAGATGTAAAAACAAGGGCCTAACAAAACCCGGCCACCC 
CCCCCCCCAGCTACTCTGAACTGGAGACCCCCACCGGATGCAGCACTCACTGA 
GGAGACTCTGTCCAGCCTTGGGCAGAGCATAAAGGAAACACCTGTCCCTGGC 
CGGCACACTCTAATCCCACTAAACCCGAGTGGAGAACCACCCCCAGCAGGCA 
GCTCCCACCCCAGGTCTGACTTTTGTGCTGGGCCTAAAGGAAGGATGGTGGA 
GAAGGAAACCCTCTGATCTCAGGAATGGAAGCCTCCTTGGGCCCCAGCCAGA 
TCCAGGTCACTGAAAGCACAGTGCCTGATGACAGTACCCCCCTTACCCTCACC 
CCAACACACACCCCTAGTTCCTGGAGTCTGAATAGGAACCAGCCTGCCTTGG 
GGTCCTTGGCTCCTGAACGGTTGTCCTCCTTTAGAAGGACAGGAGGCTCCTGC 
CACAGAAAGACCTTTCCTACCGACACGGGAGCCCCATGTAACCCTGCCATAA 
GCCAGGACCCTCAAATAACCCCTGGGAGCCCCACACTTGGCCACCCAGGCTG 
ATTTATCACTACTTACCAACCTGGAGCTGAGTCAGCACAATTGTTAGAAAAGG 
GCAGGGGGAGTAGGGGGTGAAGAGAGGGGAGTATAAGACCACCCAGATACC 
TGGAGGCCCTGGGAGGGGGCCAGGAAACTATGTACACACTACACATGAGTCA 
CGCGCGCGCGCGTGTGTGTGTGTGTGTGTGTGTGTGTGT.GTGTGTGCTCGCGC 
ACAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGCATTAT 
CCTGATGCTGTCTCAGTCTCCCCTTGCAGATCTCCTAACTGAATGGTACAGGG 
AATGGTAAGCTCCCAGGAAGGTAGATCTTGGGGCTAAAACAGAACTGTAGAT 
TTGGGGTGGGGTGGAGGCGGCGGGGGGGGGGGGTAAAAGGGTCGCTCCCAT 
TGATTCTGTCCCCATGACAAGATGTAGTACAGGAGTGGAAAGTGACCACACG 
CCCACAAGGGAACAGTGAGAGCTAGTGCCAAGGGGTCTTCTTGAGGCCATGA . 
GACCCTGCTGCCAAGCACATGCTTCTCTTCCCCTTTGGGCTCAGAGTCACCAG 
GAAAGGGTGTGGAGAAGAGCCTGGAGGGAGAGAAACAGGAAGCAGGCCCTG 
CAAACCCAGGTAGGCCATGGCTTTGCACCTACAGAGCCAACCACAGGCAGGC 
25 CCTGCCTGCATTCGCGCCACCCTGCCTGGAAGCCTCCAACTAATCCCCACATG 
CCAGGTAGCGAGGAATGGGTGGCATCCTTCTAGAAGGTTCCCACCATGTCTTA 
CTGTCCTCTCCTTCTGCCCCAGGCTCCTCAGGATTTTATCCGGTTCTGAACAAA 
TGTGCTACGTGACGATTCTTGCCACTGCACACGGCTCGCTCTGGTTTCACAGA 
'fGATGCAGACACCAGAGCTATGTCCTGCAAGACCCTCCACGCTAAGGCCTAG 
30 GAGGCTCAGGAAATCTGGTGGCAGCTGAGGATGCTGGCTTTCCTGCTCTTACA 
GTGTGCAGGCCTTGTAAAAGGAAGCCGCTGCCTCCAGGGCAGGGTGATGGCC 
- AGTTTAAACCCCATAAACCTGAGGACCTAGGATCCATGGCTCCTAAACTGTGT 
CCACTGTGATAGAGAAGGCCTGACCTGCCATTTCCACCCTCCCAGATATGTCC 
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CATGCATCCTACTTTCAAAAATAAAAGCAATAAAAGGAAAGGAAGAGAAAA 
GCTGTATTTCAGTTTACAATTCCAAGTTACAGTCCATTGTTGCGGGGAAGTCC 
AAGTGGGAACTTCAGTCAGAGTCGAGAGCAGAGTGCAAAGAATGCATACTTG 
CTTGCTTGCTTGCTTGTGCTCAGCTTCAGTTCTCCCCTACCAGTCAGACCAAGA 

5 GGTCCTACCATCCACAGTGGGCTTCCCACATCAATTAACTTAGTTAAGACAAT 
TACCCCACAAACACACCCACAGATCAGCCCAATGCAGACCATCCCCCACTGG 
GAGTCTTTCCAAGCAATCCTAGGTTTGTTGAGTTGACCATTAAAGCCAACTAT 
TACAGTAACGAGGGAGAGCGCCTTCTTCTAGGAGCATACACTGACTTTCTAA 
AGATAAGGGACAGGGACATAGTCCCAGCATCTTTGGAAAGCCCCACAAGATG 

10 TATTTAAAGAGAGAGAAAAGTGACACAGATGGAGTAGAGAAAACCACCTAG 
TGGCCCCAGGCAGAGTCATCTGAAGTGGCATGAACTAAGAACAAGTGTCGTG 
TGTGTCAACATGGAAATACAGAAGAAAGACACTGTGAAACCAAAGTATCCAC 
AAGCCATCCTGAGCCGCAAAGACCAG'TGTGGGACAGTGCAGCACACCCATGT 
GGTCCAGACACATTTGCAACTTAAGTGCCATTTTAGTGTCCATGGATCATTAG 

15 AAACCAAGAGGGTCAGGAGTTCAAGGCCATTCAAAGTTGCCAGATGTGGTGG 
CACATGCCTCCAATCTCAGCACTAGGGAGGCAGGGGCAGGTAGAGCTCTTTG 
AGTTTGGAGCCACTCTGAACTACACAAGGTGTTGTCTCAGAACGACAACAAC 
AACA^TAAAATAAAGCATACCAGCAACCACCATTCTGCACATTCTCCTTCTGA 
ACAGAATGAGGATTGGACAACTTTCTCGAAGGCCATAAAACACGTGCTGTAG 

20 GTTTTAAGGGCCACACAGTCTCTATCCTGGTTTGGAAAACAACCAACCAAACC 
ACAGAGGAGTCCACAAGCCTGCTGGCTATGTGCTACCATGCCCTTCTTAGCCC 
CTAGCAGGCAGGGTTTCAGCACTCCAGCCAACCCCTACATTCTCTAGCAGCTG 
AAATGGAGAAGATATTGGGATCCGCCCACCGTTCCTGCCCCAGCTCATGGGC 
ACCATAACAGCATGTGAAAGATGCTGGGCAGGTAGGAGGCAAAAGGCTTTGC 

25 TGCGCCCCAACATAGCAGGCATTTCTAGCAATCCAGCAGGACATATAAGCTC 
TGTGCCCAACCAGTGGGACTTTTTTTGCACAGCTTGGAGATCTTGAGGTCCTT 
AGAAGCAGCCCAGCCCCATCCATTCACCCATGTCACCCACCCATCCCCAGACT 
AGGTGTGACATCCTCCCACCTAGACTAAGCCACAGAGCAGCTGTCTTCCTGGA 
GAGAGGCACCAACAGTTCGGGAG^?TACCCCCAAGTCCCCCAAGCCTCTGGA 

30 TGCCCTGGGGGCTACAAGCAGAGCTGAGCCCATGTCTGTTCCATTATCCCCAA 
CAGCCCTCCAGGGACTACACGCAGATGGCATGTGGCTTCCATGTCTACAGGA 
CCCCAAGGTGTGAGTTCTGGGTAGGAACTTGAAGCCCTTGTCCCATTAGCACA 
CAGCCTGAAACACAGTTGGAGCTTCACAGCTACTCTCTTCCTCAATGAATGTA 



BNSOOCID: <WO _ 02054069A1_I_> 



WO 02/054069 



29 



PCT/USOl/49808 



GGAGAAAACAACTCAATCAAAAAAGGTGGGCATCTCGTGGGGAATATCTTTT 
AAAGTGGCAAAATGACAGCTGGGGCAATAGGAACCTTGTGTGCTGAGGACGT 
AGAAAGTTCTGGTCCAGGTGGGCAGTGATGGAGCACACCTTTAATCCCAGCA 
• CTTGGGAGGCAGAGGCAGGCGAATTTCTGAGTTTGAGGCCAGCCTGGTCTAC 
5 AAAGTGAGTTCCAGGACAGCCGGGGGTATACAGAGAAACCCTGTTTCGAAAA 
ACAAAAAACCAAAAACCAAAAACCAAAAAAAAAAAAAAAAAGAAAGAAAG 
AAAGTAAGTTCTAGTCCTGAGAGGGACACAATGTTTCCTCAGAGGACCGAGG 
AGCCCAGGAGAGAGCTGGCAACACTACTGGGTACACGGTTAGCTGGACAGCA 
TCTCTAAAGAGCCCTGAGAACTAGATCTCTCCAAGCAGGGGCTGGGGCCCCA 
10 CAGGAAGTCAGATCTTGGGCTTGGTGGTGAAAACTCAAGCTCCCTGCTGGTCT 
CCTGACCTTGTGCTGCTGCAGCAGTGACTCAACCTGCAGAAGTCTTCTTCCCT 
CGGCTGCAAACTAGTCTTCAGGGACAGGGATAGAGTAAGCATACTGACTCTC 
ACGAGTGCAGCTAACAGGTGTGATATTAATGCCCCCATAAGAGATCTCAGAC 
AGAACAACAGACCAGGCAGTGGGACACATGGCCAAATACAGTGCAAGCAGC 
15 AACGGTACACTAGGTACAACCCAAGGCGTGTCTCAGCCAAAAAAACAGAACA 
CCCGCTGTGACTCCAAGCAACAGAAGATGGAGAGCTAAGAGCAAGCTCTACA 
ACAGATAGACCGTGAAAACAAACTAAACCAAAGCAAAACCTAGACACAGCA 
CCAAGACATGAAGCTAGGCGCTGAATTTAACGCCAATAAAGTCTCCACTGGC 
AGGAAAGGGTGTGAAAACGAATGCAAGAACATGTGAGGCCCTGCTGAGTGT 
20 GAACGAGGGTCTGGGACCTGGGGAGCTGACCCTGTTCCACTTGAAAGTAGGA 
' CCCAGCAAGGGCTGTTTTAAGGGCTCAGAGAGCCCACCGGATCTCATTTTGCC 
CCTGGGAAACTGAGGCCTGCCAAGAGAGTTACAGAAAGCAAGGGCGGGCCA 
GGAAGGGTCCAGCTGCAGGGAGGTGGGTCGCCCCTGCGGGGCCAGGGAGCCT 
CACAGTAAGGGAGGAAACTGGAGAGATGACCTCATCCCCCAGGCCTCCGGCC 
25 GCGCGGCTCACGGGTTTTAACCGCAGGCATGGCGGGACAGTTCCCTGGTATG 
CGGGCTGTGGGGCTCACGGAGCTGGAGTGTGTGCTAGGACTTTTTCAATGCA 
GCTGTCTTTGTGTGGGATTTGAAAACAAGCTGGGGTTTGAGAGACAAAAAAC 
AGCCCTGCACCTGGGCCTGGCCAGGGTCTCCTATTGCAAGGAGGGGTGGGTG 
GGTTTGGCGGCGTGCATGCACATTCAGCAGACTTGCAC(^rACACGGCGTGCA 
30 CACACATGTGCGCCACGTCTACAGATGTACATACAGCTTGCTCTAGATTCAGA 
GGCCGGATTGGGGGCTGGGCGGCGGGAGGGGGGGTGTCCTCCTTGGGGCCAC 
AAACAAGTTTCAAGGCCAAGCACACCTATAACACCCCTGATCAAAGTTCTAG 
CCAGACCCTGGCCTGGCAGGGCCCAGTTTTCTCCAAGAACAGCITTTCAGACC 
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CACAGAATATGCCTGGCTCAGCAGAATGCTTAATCCCTCGGAGGCCGCCTGG 
CTTCCTTGGCCCTGGCTGAGATAACAGGGCTCTGGCACCACGAAAGCCAGGC 
GCTGAGGCAGGAGTAGCGGCGCTGGGGTTGGGGAACACAGCCTCGCGTGACT 
TCTTCTCCACCATTTACCCAATGTGCTCCCAGAATCCCTCACAGAGCTCCCTTG 
5 CCCTGGGCCCCCGTGGGGACCGCGGGCTGTGGTTTGGGCTCCCGCGCTCGGC 
ACTCTGGGCTCGGCCCAAGGGCTGCTGGAATGTGTGTTACAAGAGCGCCACC 
TGGTGTCTCCACAGCTCGTCAGAGGCTGC.CCTGTCCTTAGCTCAAAGGCTGAA 
ATAAACATCTTTGTTTTTCACCCAACTCCCGTCCACAGCATGTAGCAGCTGCA 
TTCTACACTATGTCCCCCAGCCTGAGGCTCCAACAGCCATTCTCACCCTAGAG 
10 GCCTGGAATCCACAACCATGATATCTGCGTGGGCCTAGGACACATGCACCTTT 
CGGAGGCCATGAGCCATCTGCAACAGCACACAACTCCTCCTTACCTAACCCTT 
TGACCAATCATCATGTCTTCTAGGAAAATATCACCACCCTATATACAAGGTCC 
TTTGAGCCATACACTTCTTCACTGGTCCATATGTCCAGGCTGGACACACGCCA 
GCCTCACATCTGCTTCCCCAACACCATGGTCATGACTCTGAAGTAAGCCACTA 
15 GACACAGCCAGCTCAAATCCTGGAAGGTATGGCCTCTAAAGGCTAGCATGTG 
AGGATACAGCTTACGAACATGGCAAAGTTGGGGACAGACAATGGCTCAGCAG 
TTAAGATCCCTTCTACGCTGGCAGAGGTTCTAAGTTTGGTTGGGTACGCATGT 
TGGGCTGTTCACAATCGTCTGTTACTCCATCTCTAGGGGATCTGCCACATTAT 
CTTCTGGATTCTGTGGGCATTTCGAGCACACACACACACAGGAGGCGGCAGG 
20 GAGAGAGAGGCTTTAAAATAATGTGGCAAAGCCAAATGGGGAGGTTAAGAC 
TAAAGCCAACTATACTTTCATCCCTATACTTTACAGTTGGAGACACTGAGGTT 
CAGGGGTATCATTGTATCCTTAGGGGGCATGCCTGGGAACAAGACTTAAAAA 
GAAGGGGGTTCTGTATCCTACTCCATATATACACTCTGAGCAGCCGGGGTGTT 
GAGCCTCCCTCTGCATAGTAGAAAAGTGTGAAAGAGATGTGAGCCTACTCAA 
25 AGAAGCCCATCATCACATCCAGGTTGCCCAGAAAAGAGCAGAGGGGAGTTCC 
AGAAGGGAGTGTGCAGCGTTGACCGGAGGTCAGTCCTAATAGTAGGAAAGCA 
ACCCTGCTGGATATATATAGTGTTTGTGCAAAGGTCCTGAGGCAGTTGGGAG 
GGAGAGATGGGGAGGGAAGGGAGGTTTGGAGATGGGGGGGGGTGTATGCTT 
: AAATGTGTGACACTCTGAAGACCATGAGGGAGAAATTGGAAATTATCCCAAA 
30 CCACCCAGGGAGCGTCTGGGACACTCAGGTTTTTGTCCTATAAAATCTCTTCT 
TAGCTGTTGTGGGGCAGGGAGGGTCCAGGGAACGAGAAGGGGACAAAAGAG 
GTAATGAGGTCACTGCCCTCACAGTGGTGGACAAAGAGCCACTTCTGGGAAG 
CAGAACAGACTGGGGAGGGGCTGCCTTGGGAGTGGGGACCGCTCACACTGAG 
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GGAATGGAGGAGGGGTGGGTCCCACCCACGCCTGCCTTTGCTTTAGGGTTGTT 

GGATCTGTGTTCACAATTCCAAAAGTAGCCACAAGATGGCAAGCAGGCCCCA 

GAAAGAGCCTGGCCTGAGGGAGGCAGCAGAAGCCACCAGGAAAGGGAGCCT 

AAGCCAAAAATTACTGGGGTAACCACCCCAAGTCTGGAGCTACCAGATGAGA 

AATGGAGTAGGTGCATTCACAGGCCCACACACATGTGAACATGTGTTGACAC 

ATGAACATACAGTACCATCTACACGTAGGCATGAACATAGCTACACATGAAG 

ATACACAATACAATGCATTCATTCACAAGTGCACAAGAATACACAAAACCAA 

GC AC AAGGCCTAATATGG ATATTAC AC ATGT AAATGTCC ACTTACACTCTC AC 

ACATGCTCAGACAAAACACACACTTACCTGAAGACACAGCTATATATAGGAA 

TGTGTCCATTTGAATACATGGGTCTGTGCATATGTGAATACATTCACGTAAGT 

GCATATATAGACTTGTACTTGGGAGGCCTCCCAGGGGAATGTTCTGACACAC 

AGGCCCTTCTCAGCATAGGAGCCATTTTGGCCCAGGCTCTCCAGACCAAGTCC 

CAAACTAAGAACTACAGAAACCTAGTTCTCTGGAAAAGAACCTGGGCCATTC 

CCCTCAGCAGTAACCAGGAGCTATGTCCTCTCATAACATATCCAGAGACTTGA 

CCTCATCCGTGTAGCAGCCACCTCTGTACTGGGCAAGGGAACAATTCTGACA 

GGAACCTGTGGGGATAGCATCACACATAAAACTAAGGAACCCAGGGCCCTCT 

TCCCCACCTTGGTTCTATAAGGGAATACAAGGGCCATGTGAAATTCCCCAACC 

ATATCTAAGAGCTCCATCAGACAGGTCTTGAAAGATTATACTCTGGGACTAA 

GGGGTGTCATCCAGGAAACTGAGGCTAATCAGCTAACACATCACAGAGAGAT 

GCCAGCTAAGATTTGGTGGCATCTTTCCAGCCACATGGAACCAACTAGGGAA 

TAGGCAGACTGGCTGGATGTTGCTTCTGGAAAAACTTGAGGCCCTGACCCCG 

ACTTGAAGCCCAGGAACCATAGGTACAGGAATTGGCACGTCTGTTAAGTGTG 

GGGACATCATAGGCTAAGGCCCACATTGAAAAGTCCCAGAAAACTTCAGAAG 

AGAAGTCGCCCTCTAGTGACCACAACTTCCAATCACAGCTTGTTACTGTCTGT 

TAGCAGCTGCCTAAACCACATGTGCCTAAACCACAGTGACATCCCAACCCCA 

GAGGTAGGCACACTGCTGGCTGGCACCACAGCCATCATAGTTAATACTTGGT 

ACCAGGTGTGCCCTGGATTGAATGCATTGACTTTTGGGATGGGAGTAAGGAA 

CAAGAAATTTGCCAGGTGCCCCTGAATTTTGCTACAGAGCTCACTGTAGGGTA 

TGAACCCTCCAGTGGCCTGTAGAGAACTAGGTACTACCAGGCAAGCCTCATG 

tttgatttggtctaca^tgaaaacacgtgtcctatggagttttgggaaagggg 
gccgttgggatgagcagcagggatggaacagagcagtgcctgcctcatgctt 

CCCTGGAGACTCCTAAAAGCCATTGACAGTATCACACGTGAATACTCAAACTT 

gaggaagaaggcacccaccacactcaggccaggaccagagagctctataatc 
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CAGGAAAAGGATCTTTCCTGTGGCTCAGTCTCTAGAAGCCATAAAGGAAAGA 

TAAAGTGGACACAAGATAAAAGGGTGTGGCTGAGCAGCCACCACTGTGCTTC 

AAGACTTAGCAGGACAAGCTGCCCAGCTGACAGAGGCTGTATATAGATAACA 

CTGAGAAACACCTAAGACCCCCTGTGCAGAAGGTGAAGGGCAGCTAAATGTG 

GCCCTTGTTCCCTGTAAGGGTAGGGAACATCCCATCACAGAAAGCAAACACA 

AGTGCACCCAAGCAGAAAGACACTAGGCTGCACCATTCTGGGCTTTACCACC 

TGTTGCTATCTTGTCCCCATAGTCCCAGCCATGCTGGCCTCCACCCATCTTCCC 

AGAAGCATCATAGATCCATCAGCATTGCTTCTAAAGGCCACATCTGGCCCAG 

CAGAGAATGGTGATACATGGGGTGTCTACCCTCATGGCTCTTGATTGCCCTCA 

CAGAACAGGGTCTGCCTCAGTTGCTTCAGTGGATCCTAGCCCTGTTrCAGGGA 

CTCATATCrACCAGATCAACAGTACCTCAGCCCCTCAAAGCATTTGAGTTCAC 

acacaacagggcaccctgaagacacagttggctcttcacagagttcaGggtc 

ACATGAGACAGTCCTCATGCCCTTCACAGAGTGTGGAATCCTAAAAGACATC 

taagcctctcacagtacatgggtcccacaagacagtccccaagctcctcgga 

GTGTAGCTGGAACCTGGTATTTGGAAATTCCATTCCTCAAATTGCTCTATACT 

gggacagggcacacactcatgtctggcctgcagagagctatgtacaaaggta 
gttgtggtacccactaggactgccacgatggccttgccagaaatgatacatg 

AAAGTCCCCTGAGCTGAAAGGCCCACCCCAGTGACTGAGTTAGAATTGGGCT 

AGGACACCAGCAGGGCCCATGAGTCAGTCAGCAATGGCACACACAAGAACC 

ACCCCACCCCTCAGTGCCACTGTACCCATGTCCTCCrGTCCTCTCTCTGGGCCT 

AGGTCAGAAATTCCCCATACAAAGGGACCCTTrCTATTCTTCCTGCATCTTCA 

ACACACGGCCCAGGGGCCTGATAGGCATGGATGCAGGGGAATTGCCCTGTCT 

GATAGCCCAGTAGCAGGGTGGTAGAGACCAGGAGTGGGTGGTTCTACCCTTC 

CCAGCCCTCCTCCACCTCrTCCAGTTGCAGTGATGACATTTACTGATGGCCCC 

GGGTGCCTACATACCATCCAGAAGAGGGTCCCGGTGGCCAGAGCGGCATACT 

GCTCAATAGTGGACAGGACACTGAAGATGAGGCAGACCAGAACAATGAGGA 

AGCTGTGAGACAGGGGACATACGGTCAATGCCTGATCCAGCTTCCCTGCCAC 

CTCCCTCTCACCCAACACCCCACATGCAAGATCTTTAGTGCTGGCATTTCCAG 

GTATTCCTTAGCTAATGCACrCAAACATGG^ATCCATGGGATGGGAACCATTT 

ACCATGTTAGGACTGACCCCAGATTGGTGTCTCAATACACACTGACCTTGAAC 

ATGCCACACTGTGCCATCTCAAAAGCAGCCAATACTCAGGGACGGTACACAG 

CCTGGTGGCCATGGCCTTAGGGACAATGGGTCAGTATATCCAATTATTCAGTA 

TAACCCCCAGTATATCCAAGAACCTTGGTGTACAATTGAAGTTCAGAAGTACT 
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ATTTTTACAAAACAGGAGCCTTCATCCCCAGGAGTGCAGGGGGATAGAGTCC 
CTTAGCCAAGGGGGAGACACAAGAAAGGAAGGCCAAGCACAGCCTGAACCT 
GTGCCCTGAGGACACCCAGAAGTCATAGCTTTTACACTGAGGGCACAGTTGC 
TGCCTGCTCTGATTCCAAGGGCACTGGGAAGATAAGTCTCCCTTCCACTAAGC 
5 TCTCCATCCCTGGGTGTCATCATCCAAGGTGGCCCTCAAAGTTGCTTCCACTTT 
GGGGAGGGTGCAGATTATTAGAGAGGGGGATGCTTTGGGGAGCTTCAGAGAC 
TCTACGTACCCTCCTGGCATGTTCTTGGGGATCAGAATAGAAGTGGGGAGCTG 

^GeA^GTGGGAGGGGT-AGAAGAGGGGX^^CAGAGAAGGIATACA^AAA 

GTCATCAGGGCCCCAGCACAGACCCCTGTCCTCAACAAAGGAACCCACACAC 
10 TCCATCAATCCTATATCAGGTTTTCCCAGTATGTAAGGCCCAGACCACCAGCC 
TCAGGCTCAGCCTCCCAGAGGGCACACAGCAGCAGTAGCCTGACTCAGCCAC 
CATTTCCTGGGAGAGGCACAACTCAGATCTCCCACACCTCTTAAGATTGATTT 
CATCCATTCCTGGCCTCTGCAGCCACCGTTAGAGGCTCAGAAGACCAACAGG 
CAAAGCCCCCCCCATTGCCCCCTCTTACCTGCTGTGCCAGGTCAAAGCCTGAA 
15 GTCATAGCTTACTATGCCTCCCCCAAGACAGACACTCTTTGTCATGCCCATTT 
GCTCTGCGATGCACAGCACGTTCCAGCCTCCTCCTCTTGTTCCTCCTCACCCCG 
GCCCAGGTGCCAACTCTACCACCTCCAGATCCCCCACATCATGCCCAGTGTGG 
TACTCCAGTGGTACCCTGTCCCCCTCTTACTGGTATTGAGGACCTGATGTTACT 
GCCCTGTCTAAAGGCAGAGACTCAGGTTCCTACAGTGTGGTCCCATATCATCT 
20 GCAGAAGTTGCTCCAGTACCCACCCTGCCCCCATGACACCTTAAACACAACA 
CTTATGGCCAAACAGCTCTGCCAAGTCCTAGTGCCTACAGCCACGACCAGGC 
AGTGAGCATCCTCAGAACAGGTGACTAATATAGAGCTTGGTATACAGAGGTC 
TTGGTACCCCACCACATACATACCTCCATTTAATCCCAAACTCTTACCCCACT 
CACCACACCTATAGTAACCTCACATCACAGTAGGCAACGTCTCCATTTGGCAC 
25 TGATCCCTGGCCAACACCCACACCAGGATTCTCACATAGAGCACACATCTGAT 
TTGGCCCCAGCAGAGCCTCCAGGTGGGCTTCAGTCCTTGCCACTCCCCATCTC 
TACCACCATGAATTCCTCATAGGCCTAGAGCTCTGACAGCAGCCAGCCTCTAA 
CCCTGGCACCCAATGCACTACACAGTATTQTTTCCTCATGTCCACTCTCCACA 
TGGACTTAAGGACTCCCCCAGAACAGCACGACAGGACATGCAGGlpCTCCTT 
30 GGTCCCCCAGTAGTGTTCCTCCTGACCAGTGGTAACCATACTGACTGTTGGCC 
CACATCTATCTCCTGGCGCTGATCATATGGAAGCCCCTTGTAAGCTGAACAAG 
- CCACATCAGATCCTCAGGAAGCCATGGCTCTTATGTGGACACAGCAAACACT 
AGGTTAGGTTGGAGCCCCTCTGCTGTCCATCCACAGATTAGATGTCCATCCAT 
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GCCTGCTAGAGAAGTCTAACTTTAGCCATGTATCCTCAACTTCTTTCAGAGAG 
CGTTGTGGCCTTTTGAGACTCAGTTCCTCGCTGACTAGGCTCTGCTCTCTCCTC 
AGACTGACTCTTCACAGAGTGGGGCCTCACTAATCCAGGGTGTGTGTTGTTTT 
GATTTCCAGCAGGCAGGGAAGTAGGGTTGGGCTCATCTCTAGGCAATCGAGA 
5 GTAGATGCCCCACCCCAGACCATGGGTTACACATGGAGCCAGAGAGCGACTA 
GCATGAAGAAACGCAGCCTTTTCTGGCTGCCCCTCCACAGTGACATAGTCAG 
GCAGCCCCTACTTCAGATTCCCAATTGGAACATTAGCAGCTTAGGTCTACCCC 

^TCTTGGAAGTGTGGGTTGTGGGAGAGTGC^C)AGCCCTGAGXGTCT^ 

TAGAAGTGAGGGAGTGACTTACCCCAAGGTCTTTGAGAAGCCCCCAAGCTCC 
10 TGGCATCAGCACACATGCAAAGATATGGTGAAAACTTCCTGCTGCTTCCCTGG 
GTGCCCTAAAGCTTACACACACACACACACACACACACACACCCCAGGGCAC 
AATGTAGCCAGGTCCGGTCACCTGGCCACAGTTTGACAGAATAGCACTAGTC 
CCTCAAGAATCTGACAGGTCACAGTGTCACCGTGCTAAAACCCAGCTCCTATT 
TCAGAGGGGGACATTGAGGTTCCTGGAGCTCTTGAGGTCACTCAGGTGGAAT 
15 CACAGTGGTGCCAATGATAGCTCTTAATGGACTTTCAATTCTTTACTGCTAGG 
GTGACAACTTGGTCTTTCAAGCCCTCCAGAGCCTCGGCTCCAATATGGAATAT 
AATAGCACCAGCACCTGAGGACCAGAGAGCTTATGTGTAGCAAGGTAACAGA 
GATCTTCAAATAGGCCACAGGGAGAGGAATGGCTGTCATAGGCCACCCTGCC 
CATGTCTGCAGGGGGATCAGGGCCCCAGCAGGGCAGCAGCTGCAGCCTTCGG 
20 GTGGAAAATTCCAACTGCTTGGAAGGAAGGTATCAAAACAAGGAGGCTGTGT 
AGAAGCCCCCGCCTCTCCAGTAACTCCCACACGGCAGTAGGGAAGGCCATGG 
ATGCACAGACCAATCCTGGGGTAGAGGTACAGAGCAGATAAGTCCCCAGTGC 
AGACAGGAAACCACCTAGTTCTCCACAAGGCTGGCAGGGGCTAGGCAACACA 
ACCACCAAAGACAGCCCATGCTAACCTGCCCAGGAACCTTGAAGTTGGGGAC 
25 AATGGTGGCAATCTGCCCCCATGTGCTGGCTGCCTTTTAGCCTCGAGTATTCC 
GAAAGCTAAACAGTTCTTCAAATATTTGAACTCATTGGGCTGTCCGAGAAAG 

GGTTATATAGCATAGTTTATTTTTTATCTACT 

TATTTGCTACTTTATCTGTTCCTGCCCTGGATAGGGCTGAGAGCCTCAGAGAG 
GCCGAA^CAAGGGTCCCTGCCTTTGGGGAGCTCGTGGGCCAGAtGCCCAGGT 
30 CACCCACTCCTGCCTCTTTAGGACAGTTGCCATCCCTCCACCGTACCTCCATA 
ACCCAGAGCCGCAAGGTCAGGCTATCATAAGGTGCTCTCCATATCCCCCACCC 
CCAGCAGCAGCAGAGGAAACTGAGGCCCGGGGAAATGCCAGTGTCCCTGGTT 
ACAGACTACATGTTCCTAAGAGGGTTCACTGTGCAGGGTATGAAACAGAAGG 
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TTCTCGGGAACAATACCCTGCTTGTTCCGCTCTCACCAAATCAAACTCATCGG 
CCGTGACCGCAGCCCAGCCCTGAGGGACGTCCAAATGCCTTATCGGACATCA 
CTTCAATATCTCTGTGCCbAGGTGCCCTGGGAACCCAGAAGAAGATGGCATC 
CAGCCAACCCCGTATGGCTGCCCAGCAGAAGGTATGGGAGCACAAGGAACA 
. 5 GTGTCGGGAGTACGGACAGGCCAGCTATGGCAAACTTGGAGATCTCTTCCAT 
ACATGTTAGGCCOTGACCAGTAACTGATCirTAAAAAGCCAGAGCTGGTTCT 
AGACCCATGGGCAACCAGACCTGCTCAGGCACAGGCCTGATGGTGATACTGT 

GGGTCTCCCCAGATAAAGGGGTCAGCCTAGCCACCATCAAATTTCCAGCACT 

10 CAAAACATGGCAGTGAAACATCTCAAGAGGTCTCAAACCATTGTATCTGCAT 
ATACAATCATAGTGTGAAGTTCCCTAAAACTGTTGTTAGGCTGGGGCTGGAG 
GTACTGTGCACAAAAGGGTAGAACTTGAGGATACTTGGGAAGATAAGCCAGG 
GATGAGATGACACATATTTCAGGATCAGAGACCCCTGGGGTTACCAGGTCTG 
CAGAGACAGAAGGCAGGATGGGGTGCCAGGGATTGGGAACAGTTTCAGTCTA 

1 5 AAATGAC AGAAAGTTCTGGAGGTGGAGGTGAGAATGGCTAGCTACACATTCT 
GTGGGTGAGCTTCACACTACTGGCATAAGCACACAGACACAGCCAGGCCTTG 
TGTCTCACCACAACTAAAATAGATTTAAGAAAGAAAACCAGGGGCTGGGGGC 
ATGACTCGATGGCCAAAAGTGCTTGCTTCTCTCCCAGATGGTCAAAGGTTCAG 
TTCCCAATGCCTCTGTTGGGTGGCCTCCAGCTCCAGAGGCTCTGATGCCCTTG 

20 TTTGGCATCAACCAGTTTGGTTCACTCATACATGCATGCACATGTGTACAAAC 
AAATTAAAAAAGAAAGAAAACCAAGCAAATGGATTTGATAATAGAATTGAG 
TTGGCTAAGTACACTGGCCATGGGCTATAATTGAAGCTATCAACATCCAAAG 
GCCCCTTGGGGGCAGCTCTCCACAGGCCAAGAAACAGTCAACTAAAAAAATC 
TAAGGTACCATGAGCCTTTGTATGCACGACACCCATGGACTAAATCCTTATGT 

25 GTGAGCCCTGAGCCATGTGTTTGCTATTGTATGTGTAACGCTGACTTTGCTATT 
GTTCCACAGAGAGTGACTTGGAAAGATTCCACACTAACCCTCACATACGCTA 
CCTGGAAAGGGAAGCCTAGAGGGGGACCCTACTAACTCACTGCCATCACCCA 
CCCTCATGGCCACCTGGACTCCCAGCTGCCTGGCTTGAGGTTAGATATAGACA 
TGTCCCAGGATACTGGACTTruAGAGGGGACAATAGGCTTAGGGATATCCCA 

30 GGAAGGATGGTACCAATAGCAACTGTCAGATGAATAGAGTTGGCAGGGTTAG 
AAGACTTCAAGGTAGCCACTTGCTCACAACACAACCTCTCCAATGTGCTGCAC 
AAGGTAGAGCCTAGGCCAGGGTTACCTCCTAAAGGGACACACCTCAGTGCAT 
GCTGAAAACCCAAGGTGTGCCAGAGACTGATCAGTTTCCCTTAACTAAGCCTT 
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GGGTGGAAGAAGTCCAGGGGAGACACAGGGGAGTGATGGTTGGTCTCAGGTT 
CAATCCACTTATGCATATTTAGAAGTTCCTGGATTAGGGACAACAAGATATAG 
CCACTTGTGGGGCGAATAAGGTCCAACATTGTATAGAGCACACACAGGGCCA 
CGTAAACAGTGAGCTCAGGCTGGCTATACAGACCACAATGCTCAGAATAGGA 
5 GAGCCACCTCATTCTACATGCTATTCTAGGCTACCTCACCTCCTCTATCTGAAT 
GGCCACCTCCTGTTCACTCAGATCTCATTGGGCACCTTCTGAAGGGACAGTTG 
ACCTTGGGTTGGGGTATACCAGATAGCCCAGGTTTTCACATCCTTAGTGGCAA 
GAGCTTTGTCTACATGAGGCCACACCACAGGCTTCAAGAATATCTTTGAGGCC 
TTTGAGTTCATCATGTAACCTACGTCAAACCACCCATCAACAGTGATTCTCAG 
10 AGGATCACGAATTCTTAAACATCAAGATCAGTTGGTCCAAAGCACATACTGC 
TGTCACAAGCTTACGGGTGCTCAAGCITTTGACTCCATGGCAGCATATTGTCA 
TCTATAAAGTCCACACAGAACTGTGGGACTAACAAGCAAAAAGAAATCTTAA 
AATCTTAAGTAAGTTTATGATTTGTGGGAGGGCCACATTCATAGCTGTGTTGG 
GCCACACAAAGCCTGCAAGTTGTGGGTGGGAGGTGCTACATAAGTTAGGGGA 
.15 GACCCAAACCAGTTTTGCCTCCCACACACCTTAGGCCTATACAAGGTCAGCCC 
TTGCCTGCCTCCCTGAGGGGCCTGGCCTCAGGCTGGTGCCTTTGCCTATGGCT 
AGAGCTTCTAGGCCTGAGATTCTCCTGGGCCAGCCACTGCCTACTCCCTGAGA 
CTACACTGGCTCTAAACCCTTTTAGGCCTGGCTCTGGCCATAGTTCATTTCATT 
AAAATGTTCTGGTAGGAAACACCAGCAGTGGTGACTTTATCACCGAGGAAGT 
20 GAGCGTACTAATCCATTGTCTTGGGAGCCCTTAAATGTGCTGATCTGGAAATG 
CAAACCTATCAGGAGATTAGCCTCAGCCCTGCACTGAGAGGCCTGCCTCACCT 
CATAGCACCAGCTGACACTGTTCCCATGATCTGGCCTAGAAACCCCAGAAAA 
GCAGGACATTCACCTGGCTCTGCAGAGCCCCCCAACCTCTGTGCCTATGCACC 
CCC€CCAAGGCTACAGGGCAGAAACCCAGTGCATCTTGTGCCCTTATTCTAGG 
25 GCAGCATCCTCAGTCATCACAGAGTCCTAAGAAATGAGTGGCCCTACCTTCCC 
AGGTATCTATGTCTGGTGTCCTCCAGCCCACTGATTGCTTGAAAACCTCTGTT 
CTAGCAGAGAAGCTCTTGGCCCAGTAGGGGGACCCATTGGCTGGCACACATC 
TGAAAGCCAAGCAGTAGGCCAAAAACACTGTGTGTGTGTGTGTGTGTGTGTG 

tgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtctag,;atttcctgaccgagtg 
30 agcaattgttaagctgagaggaagtctctggtcctcttctacggaacccattt 
gtgcacctgctctgaatctctagctcaggcaagctcttcacagcctctgaact 
gggtcagtgcaggatacaggacaaaacccagacctgatgctggccaagtcat 
tagcactggtcatgaggagatgaatgtccttggtggctgtctccagattaaca 
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GGTTTAAGGGTGGCAGTACATCCTATGTCATACACATGACCAAGCCAGGTCCT 
TCTAAAGGCTGCAGAGGTTGAGGCCACCAGGTCACTGAGTCAATGGAACTCC 
ATAGCCACTCTCCATTCAGCTAGACTGCTAGTAACAATTAGAACACATGATTT 
CCATTTTGGAGTACCTTCCAAGAACTCAGTGTCCTTGTCCCTTAACACTCAGA 
5 GCAGCACCTAGCATTAGACATCCCATCCTAGTAGCACCACCATGTGTGACCCA 
GAGAAGCTTGACCCCTCCCAAGCCACTTGTCAGGCAAGGCAACATCCCCAAG 
CCTGGGTCTCACCTTAATCACACATGTCCACAGCTTGGCTCGTAATTTATCCC 
AGCTCTCAGAATCCAAACTTTACTTCAGCAGAGTCCCCAAGAATAGGCAATG 
GTGTCACTGTTACCCCCTCTCAGGGAGCGTCTTAGTCTTCCTCTCCTAGGTTAG 

1 0 CATTC AG AT AGGTGAAGACC ATCCTTGGCAAGGGGTTC AAGCCATCTGC ACT 
ATCATTTGAAACCTTTTTrGGAGGCTGGGGAGAGCAGCAAGGTACTTGGTCCA 
TAGCATGATGACCCGAGTCTGACCCTCAAAACCTTTGTTTGAAAAAACAAGG 
CATGGTGATGTGTGGTTATAATTCCAGCTCTGGAGAGACAGAGAGGCAAGCA 
GATTCCTGGGACTCGGTAGCCAGCTGGTTTAGCCTAATTAGCAAGCCTTGGGC 

1 5 CAAC AAAAAGACCCC ATCTC AAAAACCAAGACTG AC AACCCTTGAAGAGGG 
ATTCTTTTGGTTTACCTCTGGTGCACCACACACACACACACACACACACACAC 
ACACACACACACACACACACACACACCTTCACACCTTTTTGGAACACTAGAG 
GGATAGGACAGATTTGGGGGGAAGATATGCTCAGATAATGTGCTCAGAGCAG 
GGAGGCAGGAACATTAGCCAGAGTCAGAGGGAAGAGCTGGGAGCACAGCTG 

20 CGCTTCGTGGCTAGTGGTTGTATCTGCAGCTCGGAGAGCCTGTGGTGGTCAGT 
GTGTTGACCTCTATCCCAGAGCAACTCAGGTTGAGAGTTTGTGTCTGAAGATC 
TGCAGCCTAATAGGATCTGCCTGTAGGGCCAGACATACATGACTCCCACAAG 
GAACAGAAACTGCAGCACCTGCCCTGGAGCAGCTGGCACCTGCCCTGGAACT 
GCTAGTTTTCAGACATGTAAGAGCCCCTGGGGCTACCCCAGCTGGATGGCAC 

25 CACATAGGGCCAGACTCAAGAATTCAGTTAGCTAGCGCTCTCTTATTCCTGCT 
TCTCTCCCTTGCCTCITGCCCTCnTerCTCCCCCTCCATCCCCCArcCCACTTCA 
CGTGGTCATGGCCAGCCTCTACTTCTCTACTCTCTCCCTCTCTCTCTGCCTTTCT 
CTGCCTCTGCTACTCTCTTAACTCCCCTCCCCATGCCCTAAATAAACTCTATTC 
* TATACTAAAAAAAAGAAAAAAAGAATTCAGTTAGCAAACAGATCTGAGTCf i : r 

30 CTTACGAGGTCACAATGGCACCCCCCCCACCCCCAGAATATGAACCAGAAGC 
CTCAGGTATGGGAAAAAGGAAATGAACCAGCTAGGAACCAATGTCTTCCTGT 
CAATTrAGGAGGTACGTAGAGATGGCCTGTCTTTTATTTACAGGATTGAAAAG 
AGAAGCTCAGAGAGGGTTTATGCCCTTCCCAAAGTCCCACAGCAATCAGTAG 
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TTAGTGAGATCAGGGCACACAGGATAGATCTCCTCCCAAATATCTCAAAACC 
TCTAGGAAAGCCATACCCCAGAAGGGGAGCCAGGCATCTCCCAAACACCTGG 
CCCTTCAAGGAAAGGCTCTGGAAGAAAGGTAGTTAGTAGGCCTGGTGGAAGG 
CCCGTGATCTCTCCCACACCCCAGGGCTCAGAGCTGTCACTGCTGGGTGGCTG 
5 AGTACTCCAAAGACAACCAACAGCCTATGAGTAAGGAAGCATGCCATTCTTC 
TAGGGACCTGAGGAGGCCAGGGCCAGTCAGTACTCTAGAGTGTTGTGTCCCT 
TGGCCGTCGGCACACCCAGCCTTGCTCCCCCAGGCTCCATGATAAGATGCCAT 

AGTG ACT AT ATTT AQ A A TfiTOTGG TGGGGAC AGGCTTGTATGAC AC ATGCT TG 

GGACCTGTCAGTGAAGGCATTCATGGTTCTCCAGCGTGATGTGGGGGCCACA 
10 TGCTCCCCATACATATTCCCCACTTGGGACATGGCCTCTGTATCTTGGAGATG 
TCTCTACCCAGGACAGTGTTGGCCCAGCAGCCTCATCTGGCTCAGAAAGTGG 
GTATTGACTACTTAGGCAACTCACTGGGCTGATACATCCCTGCAGCCAGCCAT 
ATACCTGTATATGTCTACGTTCTCACAGTTCAGTCCTATTGGGAAATTCCCTCG 
TTACATCAAGCTCAAGTCTCTCTTGCTGTCTTCTGGGCCCCATTCCAGAGGTTT 
15 TAAGGAATCTCTGTCCACCAAGCCATTGCACAACCTCTGAGCCTCCGTTTCCC 
CCACGGAGGAACTGGAACACATCAGACTGTTCAGCAGCTGTCTGTGTTCAGT 
AAAGGCGGGACAGTTCCACGGTGCCCAATCCCAACCATATCATTGATATTCTA. 
ATAATGGTATCAACACCCTGGGATCTGACACAACAGAGGTTTCAAGGGCACA 
TGGCCAGCATCCCTGGCAGAAACCTGCTCAGGTTCCACTGTOTCTTTGGTAC 
20 AAACCTTCAATAAGGTGACTCCAACAGTGTGGCCATAACAAAGGGGAGCTGG 
C1TTTGGCTATGAGGGTGACCCAAGTTCAGAGAATTGACAAGGTAGCCAGGG 

gtaacctaccacccgctccaggaataggcacccacagttgtctctgtagggtt; 

TGGGCACTGCCATGCTCTGATCAGCTGTATGGTGACTGAGATAATGTGGGGTA 
ACCCTAATATCCCCTCCACATGGGTGGATCATAGGGCTGGAACAGGCAATGC 

25 TTCC AGTTAGGC AGGTTC AGTCCCCATCTCTCACACTCTCTTGTAAGGC ATGG 
TGGTGGCAGAAGGCTTCTAAGATGTGGAGACTGATGAAGAGGACATGGGGTC 
TTTTAGGACAGGGTGGGTGAGTGGATCTCCTGACCTGTGCCCTGACCTCCTTC 
CAC<:TGAATGGAATTCCTAAGTGACATTCAGCCATGCTGACAGCAGAGGGCA 
CCAGTGAGAGCA.:\ATAATGACACCAAACAGTGACAGCCTGCAAATAGAGAC 

30 AGCCACAAAGACCCAAATCACCCAGGGAGCAATTGAGCTACAGATTATCGGG 
GTCCCCAAAGCTACTTTGTATAGATGTCCCCTTAGGGTAGGGGTACAACACAG 
TAGTAGTAGTGGCCCCACAGAAGCCCTTGAGCAGCAGACAGTATACTTAGTG 
TGACTATGCAGAGCCCAGGGTCTAGTCTTTGCCCTACCTAGAGAAACTGAACT 
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CTGAGAAGGATAGGATCTACCCTACATGTTGTCATGACAACAAAGACATGGT 

CTCTGATTCTTGACTCAAAACCTCAGCCTGGGTCACTAGGGTAGAGCTGTACT 

GTTGGAGATGGATAGGCTGGCTTCCTGACAAGAAGGAGCCTCAGGTTCAGGG 

AGAAGAGCTGCCAGTCCAGAGTATGAAATGACCACAGTCACATCTCAGCAGC 

TTCTGCCCTTCCTGGTAAGGGCCTCTGTGAGGATCAAGTACATGTGCTGGGCA 

CAGGTGGCTTACTTCCCCTGGCATCATTCATTGAGAGACCTACATGTCACCTG 

TCCACTTGACAGACCCTGACCTTGATGTTGTGTGAGTGATCTTGGTGCCCTGG 

TCAGAAGTGACCTGGAGTCACATACTGGAACAAAGAAGGAATATCCTGGGTC 

CCTGCAGCCATGGTTCATGGCTCAGGAGATCTGATGGCCCAAAACCCACTCA 

ACTCTGTGGTTCTCTTCTTGAGGACCCCTCCCTCTCACTAAGAGCCAGTCAGG 

CTGGAGACCCCAGAGCAGCCACAGGTCTCTTCCTGAATCAGGAACCTCAGCC 

TCCTGAGATCGTGCCTCAGTGCCAGGCACCCTCCGGAGTTCTACTGCTGTTGC 

CCAAGTGACTTGGACCAAGAAAAGCCTCTGATGACAAAACAAGATGGTCACA 

TTCACTTCAGTGCAGCCTCAGCAACACACAGCCTCAGGCAGTGCCCATCAATG 

GGACTGTTCTTCTGGGTCACAGGCAAGTCTCGATCTCTTGACAGATGTCTGCA 

CCACCGACCATAAGACAGAGGACCTCAGGATCATGTGGGACAGTGTCCTTGT 

CAGTTTTACGGTGTGACCACGCACATGGATGGGGCTTCCTTAGAAGAGTACA 

AGAGAGTACAGTAAGCAAGGCACAGGATGAGGGACATGCATATATGGCTAC 

AGGGATGGGGGGAGGGAGGGAGAGAGAGAGAGAGAGGTGTGTGAAGGGCA 

AGGATGGGGAGGGACAGGATGTAGAGGGTCACACATGTCAAAGGAGATGTA 

CCAGACACACAGTTTCCCTGACCACCCATCGCAGAAACACTACCCATGTGCC 

ACTATCCTTCAAGTCCAACCCCAAGCCACACTCTGCAGAGCCACCGATACCTT 

AGGGAGCCAACCAGGCCACACCCACAGCTCTACTGCCCCTCCACTCACAGGC 

CAGACCCAGTGAAATCTATTAGTCACTAGGGATTGCTGGGCTCCCtAGCTATC 

AGCCACTGTCCTGTGCCAGACCCCCACCCCGTGGCTGCTGCCTCATCCCAGGC 

CTTCTGCTCTCACCAGCCACCGACCCCAGCCATAGCTCATCCACTAGGGCCCT 

GGCAGGGTTGCTTCTGGGGGACCAAGAACTCACTGGACTGAGGAAGTAAGGT 

CCTTCCCACCCTTAATCTCAGCTTACTCTCAAGGGCTGAACATGGCAGCTGGT 

ATGGTGGCACTAAAGCAGACAGAAG'CAACCTGGCAGAGCCCTCAGGAGGAG 

GAATAGGAGCAGCAATGGGCATGCCTAGGCCACTTGCTAGAAGTCAGTGGCC 

AAATGCCTAGGCTCCAAATGACACCCTCAGACACAAACTCCTAGGGACTAGA 

GGAAGGCCAAGAGTAGCTGAAGGCCCTTTTGTCACCCCAGCAGGCTCAAGAT 

ATATGCAGTACTTGGGGTCACTGCCTTGGGGCAGAACCATAAACTTCAAAGG 
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CCCTGAAGCCCACATGCCTATGTATGGAAAGGCTGATAGCACTGAGTGTCAT 
GGCCCTGGAAGGACCCTGGCCCAGCTGGCACATGACAGACACTGAACCCTGG 
GTTTTCTGTCATCACAGTGGGGCAAGACCCAGAGCGGTCTCAGGAAGCCATC 
CCACTTCCCCCGCCTCTGCCTGGCCAGGACACACTTCATTTCCTGAGCCTCTTC 
5 CTCGGCTACCACACTCAGCTCTGACCTGTGCCCTCCCCTGGGAAAGCCAAGTA 
TCCTTTGGGAGCTCTTTCCTGGCGTTGAGCTGAGGCTGAGGCCACGCTACACA 
TCAGTGCCTCCTCCCrGACTCTGAGGCTCTGGGGACTTCCAAAAGCTTGCCAG 
CCCTACCCTCTATAGGGCCACTACCCCTAGCCACCAAGCCTTCAAAGTGAGGA 
ATGTACTAGGTGTGGAGAGAATGACTGTTCTCCCAGGGCAGGAGTGAGGCAG 
10 TGACAATAATAATTACCATGGGCAGAGTTCCTTCAGGCTGAGTGTCCTCATCC 
TACATGGGGGCTCTGGCCACAGGGCCCCATAGCCAGTGGACTAAATGATACT 
CACCCAGTACTCAGACTTCTGTGGAATTCCCTGGACCTAACTAAACACTGCCC 
TGCACACAGAGGACAGCTGCACCTGGCTAGGCTACACTAGCTCTGGGAAGCT 
TTrGGAGACAGGTGGCAATTGGTATCATGTTTTCACTTCACAGTGCCTTCTGG 
1 5 T AATGCTGCGGCTTT AGAAGG AAAGGC ATTAGCCTAAAGCTACTTGCC AGAC 
CTGGCTCCACCCCTTTTCAAGGGCCCATAAGGAACTCAGCCTTTGGTACACCT 
CGGCTGTCATCCTAAATGGCTCCACCCAGAAAGTACCAACTGTGTGTGTACCA 
TGCAGGCTCTACAGATGCAACAACCGACAACATAGACACCGAATACTGTGGG 
GGAAGGGGTGTACTAACAAATAACACAATGGGAAAAACACAAGATGAGGTC 
20 ACAGTGACTAGTGGCTCAGGAAGGGCTAGGTCACCACGAATGATAGGCCAGA 
GACAACAAGGGAGGTCATGTGAGTGGTGGGCCAGGAAGCCTCATGGGGCAG 
CAATGTTGAAGCTGTGTCATTAGAGCTGGAGACAGGTGATCCAGGTAGAATT 
AACTGCATGTGCCAAGGCCCTGGACATCTAGAGACCAGAGTAGCAACAAGTG 
TGATGGGTGAGGACACCCTTTCATAACAGAGTCAGAAGGAAGGAGATTCTCA 
25 CAGGCTAATAAAATACAGAGAAGTTTGGTGGGCACTGCAGATGGCTGCTGTC 
CAATGAAGTACCATCAGCCACTATGTGTGCGGGGTGTTGGGATACATTGGAG 
AGCATGTTGCAGTGGCTGTCmTAGAACCCCCAAGGCCTGCCCTATGCCACA 
TTCCTCAGACCCCTCCAGCACCCCTGCCrACACCTGTTACCTTGGGCTCAGCC 
AATGGCCACCAAGCTCCTTrCCTTCAGGGAGCCCCCrCATXirTT^ 
30 GTCTCTTGAATACCCAGAGCCAGAAGTTGACAGAACAAGAATGGGGCATGCG 
CTGAAGCAAAGGAAAGGGTCCAGACATGAAGATGAGGCAGGAATGCCAGCA 
TGTCCACCACATGTGGGAGCTCTCCATAATACCCCCTTGTATATAAGTGGGAA 
CTGCTATTTTGCTGCCTTTGGTCTGGGATAGCCITCTGGACAAGACCTGGCm 
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CCTTGGGCTGAGGATTGAGTCTCCTCTCCAGACCACTGATCCCCAGGCTCAAG 
GACCCAGCCTTCCTGTTGCAGACCAGTTTTGGCCAGAGGACAGTGTTTGCCAG 
GCAGCAGAGACTCTGACCCTGACCATGGAACAGAGGAAGGCATACAACTGG 
GATAGGAACCTTGGGCTCTCATCAACCCCATTCTCCAAGCCAAGTTAATGAAC 

5 ATATGCTGTTAAAATAAAAACCTTTGAGGTTAAGACCATGGCTCAGGATATA 
AAGCACTTGCTACACAAGGATGAGGACCTAAGTTCAAACCCTGAGGACCCAT 
GTAAAAATTCAGGTATGGCTATGCATGCCTATAACCCTAGAACTGGGGAGGC 
ATAGACAGGTGGGATCCTGGAGCCCCCTGGTCCGCTGATCAAGATGAATGGA 
CGAGCTTTCAGTTCTTAAGACATCCTGTCTCAAAAAAATAAGGTGTGGAGTGA 

10 TAGAGGAACACACTGATTTCAACCTCTGGTCTCCACGCACACACATGTGTACA \ 
GCCCTATGTACACGCACATACGCTGCAAACAAAATAAAAATAAAAACCTTTG 
GTGGTTTCTCTGTCCTTCAAGCCATGCTTCCTATAGTACAG.TGGTATCTGCTCA 
TTGGGAACCTCCAAGTCCCCATTTAGAAGCTTCCCAGACCATTGCTCAAACCT 
GTTTGGGGACATGACTCACAGTCATGATGCGTGTGTCTTCCTCCATGACCCTC 

15 TCTCCCCACTGCAGGTATGTCTCCTTGGTTGATGACTTTAAGTGTTTACTGGAC 
ATATATTGGGGTAGAAACTGAACAAGTAACCATCCTCAGCCCTTCACAGATCT 
CGTGGGGAACTAGGAATATGATGGTCTTTCTCCTTAAAAACACCCTGCCCACC 
TGTACAATAGTCTCTATACTGGAAGCCAGGAGGAGGGTCATCCTCTAGGGCC 
TGTAAGGCTATCTATATGAGACTCGAGTCACAGTCTGTGGAGTCTGCTTACTA 

20 ACCTCCCCTGCAATCTTCATTCTCTGTCTAACATAAAGGATGCtTATTCCAGG 
ACCCATTGCTGTTCCTGTGTGGAACACTCAATTCCAATCTGCCGCAGATCCCC 
AGTGTAGTCTTCAAGGATATCCTGGAGAGCCCAGGCTCCCAGATGTGTGCCTG 
TAAGGCTATATTTTGTAACAAGGTTGCCACTGTACTGATACATACCTACACAG 
TTGTGGTTTGTAACCCTGTCCAAATACTCCAGATAATGAGATTATACTTCGCA 

ACTGTAGCTGTACAGATGGTTGTGAGCCTTCATGTGGTTGTTGGGAATTGAAT 
TTTGGGACCTCTGCTTGCTCCGATCGACCCCGTTCACTCTGGTTAACTCTGCTC 
AGTCCCTGCTTGCTCTGTCCCAAAGATTTATTTCTTATTGTACATAAGTACACT 
GTi.GCrGACTTTAGACGCACCAGAAGAGGGCATCAGATCTCATTAGGGTGGT 
30 TGTGAGCCACCATGTGGTTGCTGGGATTTGAAGTCAGGACCTTTGGAAGAGC 
AGTCAGTGCTCTTACCGGCTGAGCATCTCGCCAGCCCGCAACTTTATCTTCTA 
AGGCAGCTACTAGAAAAATCTAATTGCCCACACAGAGGTGAATATGCATGCC 
CTACACCACTAGGCGACCTGAAAATAATATGGTAGATCTCAACTAACCAATG 
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CAAGCCATCTACTAACCAAGAAGAGCAGAAAGCAAAGCCTATGTATGGCTCC 
TTGACAGGTGGTGCCGGACAAACTGGGACTCTGCATGCAAAAGAAGAACACC 
AGACTCTTACACTTCATACCCATAACCAAAGGCCTGCATGTAAAGTCTAAAAC 
TACCAAACCCTTAGAAGAAAGCCTGGGACAAATGCTTCATGATATTAAATTT 

5 AGCAATGATTTATTTTATATGACACCAAAGGCCTGGGCCACTACAGGGAAAA 
CAGATTCATTACACTTCATGAAAATGTAAATGTTGTGTGGCAAAAGGCAATGT 
CGACAAAATGGGAGATTTTTCGGATCTTGTATCTAATAAGAGCTTAGTATCCG 
GAATAAACAAAGAGTAACTAAGGCAGGCGAAATAGAATTCTGAGCACCTTCA 
ACATTAGTGTCTCACTTCCTTTACTCACATCCTTAAATCAGCAAGCCCTGAAC 

10 CCACTCAGGATGGGGCACCAGGATGAATCACCAGCCTCATGCCAATGCAGCA 
CCCAATAGGGTCACACCAAGTGTGTGTGGGGGGGGGAGGGGACAGGAAGCC 
CAACCTGAGCAGAGCCACTGTAGCCAGCAGGGATCCAGGGTACACTGAGGCT 
AATGGCAGAAGGCTGTACACACACAGACATTGGGTGCCCACCCTCCTGGATG 
CTCAAAGAGAAGTGGCAACCTGGAACTATCACGTGTGTGTGTGTGTTGGGAC 

15 CCAACAAGCAAGTCTTCATGCCAGGAGCCATTCACACAGTGGGAAGCAGCTG 
CTACAGTTTGTCTTGACAGCCTGGAATTGAGCTAGCAGTACGAGGGCATACAT 
GTATTCATGCATCCAGGAAAGGGACACAGAGCAGAGAAGGCAGGCACACTG 
CAGTGTTCCCACCAATAACACAAGGCAGGGGTAGGGGGTTGCAAGCTCGGCT 
TTAGGGGACTGGATTCCAATACGAGGAACCTAGCTCTACTGGGAGAAGAGCT 

20 GGATTTTGGACTGGGACAGGGAAAATCCCAGAGGAGTCTGCAGCCTTAGGAA 
TAAGAGACAGGATCCAAGTTAAAACAAAACAAACAAAAACAGAAATAATAG 
AAACAAACAGAAACCCTAAAACAGGGCCACCATGTGACTTAAAGGGTTACCT 
AAGAGACCTGGATGCCAAAGCCACAGTATTTGAACAACAGAAATAAAGAGT 
ACAGAGCTGGGCATGGTGGTACACCCCGTCATCCCAGGACTCTGAGATGGAG 

25 GAGGGAGAGGAGGGCAACACAAATTATTCTGAGGCCTACTTGGGATACCTGA 
GACACTCTCTTATGTCTCAAAACAAAACAAGACCCACAGGGCTGGAGAGATG 
AATTCATGAGCAGGAGTGCTTGCTGTGGAAACATAAGGACCTGACTTCAGAT 
CCCAGTGCCTACCCAAAACCCACATGAGTGTGACACCCAGTGCTATGGGGGA 
ATAAAGGGAGCAGGAG^jATCAGAGGACCTTGCTACCCACCAGCCTAGCTCCA 

30 GGGTAGGTGAGAGATTCTGTCTTAAAGGAATAAGCTGGAGAGTGATAGAGGA 
CGCACTCACCCCATGCCCTCTTTGGGACCTTACACGTGTATACATATACCACA 
CATAATTATAACACACACACACACACACACACACACACACACACACACAGAC 
GCAGCACACCACAACCAAATAACAAAACCCAAATTGAAACAAAACAAATGA 
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ACAAACATTAAATAGCAGTTGAAGTGTTTGCCATAAAGTAAATATCCTTAGGT 

TGCACTGATCTGAATCTATCCAGGGAAAGTAGCTGGAGGGCTTTTAGGGAGA 

CCCTCAATTCTGAGACACAGGAGGAAACTGGGAAACAGATGCCCTGCTCAGC 

AAACATTCCAGAAAGAACATCACAGATAAGGCCCACACACGGGACAGGAGA 

AATAGCTCAGTAGTTAAAGCACAACTGTCTTGTAGACCTGGATTCAATACCGA 

ACACTCACACGGCAGCTCACAACTGTAACTCCAGTTCTAGGAGGTCCAATGTC 

CTTTTATAGTTTCTATAGGCACCAGACACACAGGTGGTACATAGACATGCATG 

CAGGCAAAACATCCATACACATAAAACAAATCTAAAATATAGGAGATCTACA 

GAAGTCCTGTGTCAGTCTGACTTGTATGTCTGAAGGTGTCTGCCCCAGGATGG 

TTTCTGAGGATAAATAGTTGGCACAACAGCCCCCACAGGGTAGCCAAAGTCA 

ATGTGACACATGAGGGGACAGCACAGTCCAGCAGCCCTGCTCAGCTATACTC 

CTGGGACCCAAACTGCAGCTCCTGGGCTGACCTTCCTGCAAGTTCACAATTTC 

TATCTGATCACAGAGTCCATCAGATTGGAGGCATGGGACGCCAACAGCTGGC 

ATTCCTCAAAATGCCAAGGTCATGAGAGACAAAGATGGAGCTTTCATGAGTG 

GTAGGAAACAAGGCAGATGTAAAACCTGTCTGTGGTAGAGATCGCAAAAGA 

GAAAAATATATTTTTGGTGGGATGGCATAAGTCACTTACTGGTGACAGTCTCC 

TGGCTTTAGTCACATGGCCAAATGATAGAATTTGCTGAGACTATGGGAGCAG 

CTGGAGAACTCTACACTACCCTGGAACATTCTCAAGCCAAGATTTATCTTGAA 

ATAAAAAGCAAGAAACAGCTGGGCGGTGGTGGCACACGCCTTTATCCCAGCA 

CTTGGGAGGCAGAGGCAGGTGGATTTCTGAGTTTGAGGCTAGCCTGGTCTAC 

AGAGTGAGTTCCAGGACAGCCAGAGCTATACAGAGAAATCCTGTCTCAAAAA 

ACCAAAACAAACAACAACAACAAAAAACCAACCAACCAACCAACCAAACAA 

ACAAACACAAAAACAAAAAAAAACAAAAACAAGAAACCTGTACGGATGCTG 

AGAAACCAAATCTTAGCACCTGTTACTTGACCCTTCACTTATTTCCTGGTGGT 

CTGAGCCACAGCCAGTGGCCCATCTCTACTGCCCACCTTTCTCTCCTAACCCT 

ACCTCCCTCAGCCCATCTACTCCAGCCCTGTCCCCCTGCTGCTCCCACAGGTG 

TCAAGCTTACATCCACAGGGCCTTTGCACACCTGATGAGCTTTTCCAGAAAAA 

GAATGGCTCACCTCCTCTATTGACCCCGGTTGAGGTGGTGGCTCCCCTGGCTA 

CCTTCCAGAAGGATGCTTGGTTTCCTCCCACTfirGCCCTGTATGTGGTTTCTTG 

GGGGTCTTGCCTCTTCCTGCATATCTGTTTGCATCTGTGTCTACTCCTTGGAGT 

ATCACTTTCTTGTCTCATTACCTCAGCTTTCCCACAATTCCAAGGGCCCAATGG 

GAGGGCAGTGGGATGGTAATGAATAGGACATGAAACAGAAGCATATCGACA 

ATGTGCCTTCACTGAATGTTTGTTGAGAAAAAAACTTGCACTATGTAGCCTGG 
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CTGGCCTGGAACTTGCTATGTAGACCAGTCTGGCCTCAGACTCACAAAGAACT 
TGCCTCTGCCACTATATTCAGTGAGATTCTATATTTTAAAATTGTGCACATGTG 
TGCATACTCAGGTGTGTGGGGTGTACTGCACAAGTGTGCATGGAGCCAGAGG 
ACAAGCTGGAGATTCATCCTCAGGAATGCTGCACACAGACTTTGAGACAGGG 
5 TCTCTCCTGGGCCCAGAGCTCATCTATTGTTGAGGCTGATTGACCATTGAGCA 
CCAAACATCCTCCCCTGCCTCTGCATCCCCACTACATCCAGCATTATATGTAA 
ATTGAGGATCAAACTCTGGTCCTCAGAAGCACTTTGCTGACTGAGTCTTTCCT 
TCCAAACCCTGAGGTTCTAATTTTGTCTTCATCAAATTAATGAAGTGTTAGGT 
GGTGATCCCCAGTTTTAGAGGTAGACGTCAATGATGTCTCACAAGGTGGGTG 
10 GAGATAGGTCAGGGCATTTCCTGCAGGCCAGTGTGACTTTGGGTTCCAGAGTC 
ATAAAGCCGTGGGAAGTCTTTGATCCTATCAGTCTCCTGCAGATGATGCCCAA 
GGACGCAATCCTGAGTTCCTGTTGAACTGTGAGGATTTGCATCAGAGTGCTAT 
TnTAATAGTCAGTGAGGAAGTGGTCCAGGTGTCCACAGGAGAGAGTCAAAC 
AAGCTACATCAAGACCACAGTGGAAAGGCAAGGACCATGCAGGAATGTAAA 
15 CAAAACTTTAAAAAGCAGGTCAAAGCAAAGGCCCAGCATTGAGATTTCCTCA 
AATGTCTGCATGAATGTTCAGACACAGATTGCTAGTGTGCCTAGTGCTGATGA 
GTGTCTCCTGCTGCTCTGATGACCTGCTGTCCCTCAGTTCACTTGCCTCACCCT 
CAAACACAGGTCATGGACTCCCACAGTTCCTCAGGAGAAACTGAGACACAAC 
CAGGAAGATGATGCTAAAGAGCTGCAGTTTTTGGAAAATTGCCTCCTGGTTCA 
20 ACATCATTCCCGTTTCTCTAAGGCTGTGATAAAACACTTTGACTGGGAGCATC 
TTAGAGGAGGAAAGGGTTTATTTGACITATTTTTCCAGGTCACAATTCATCAT 
TGAGGGTAGTCAGGGCAGGAACCCAAGGCAGAGGCCTGAAGGTAGGAACTA 
AGGAGGAGTGCACCTTGCTGGCTCTCTCACAGGCTCACACTCACTAGCTTTCT 
TATATAGGCTGCCAGGGAATGGTGTTATTCAGACCCCCAGGACCACCTGCCTA 
25 AGGAATGGGCTGGATCCTCCTACATCAATAATAAAGACAATCCCCCACAGAC 
ATGCCTACATGCCAAACTGGTCTGAGCAATCCCTCTATTAGATATCCTCCCCA 
GGTGATTCTAGGCTGTATCATGTTGATAATTAAAGCAAAGAAAGGAAATCAG 
AGTGTGCCCTGTGTATACACAAGACCTCAAGTATGTGCATACACACACAAAA 
AGAAGAGAAAAAAATrGTGCAACCACTTACATTAACAAGTCACAT?-.GAAAAA 
30 ATCACCAAAGGACCTCCAGTTAGCCTCGATAGCnTrTCTTACAACGAATTAAT 
GTTAATGTCTTCCTTGAAATTCCCCAATGTTCTCTGCTGAGCATTTCTGGATTT 
TTTTTTTTTTTTTTTTTGATCAGAAGAACTAACAGAAAACCA 
TGGCAGGGAGATGTTCAGATACATCAGGGGTGGCATCACCACtGGGCTCTGA 
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CTAGGGCAGTGACACAATTGTTATCTGTTCCTTGGCTCAC3AAAAGGAACTGCA 
GGGGTTATATGACCTTTCTCTAGCTTATCAATCAAATGTGTGGAAACCACCAA 
GTCGCAAACCTTTTAAGTTTGTTACAATATGACTTACCAAATCAAACTtGGAA 

CTTTTTCTACATCTGTAGCTAAATGCCTGA 

5 TAGGAATAAGACTGGAATTATGATGTGTGCATTAGTTAACTCCCTCATCCAAT 
CAATACCTTCCAAGAAGCAACTTATGGAGGAGGAGTTTATTTTAACTCAAGGT 
TTGAGATGTGCAGTCATCCTGGCAGAGAAGGCTGTGGCAGGGGGAATGCTTC 
TCAGCTCTAAGTTGTCATGTGACACACACACACACTGTAGCATGCATGCATAC 
AAGCCCATTAGACAAATAAATAAATAAATAAATAAATAAATAAATAAATAAA 

10 TAAATGTAAAGACCTGCACACACATGTCCAGAGCTCTGTGACTCTCAACAGCC 
TAGAAAACGTCCCAAGGACCTGTAAGATGAGAAACCAGAAAATGAAACAGG 
AAGCCACACAATAGCTTTGAGTTTTTGTTTTGTTTTGCTTTGTTTTTGCAAAC^ 
AGAGTATTAAATTTATTTACAAAGTACACCACAAAATTTGATTCATCCTAAAC 
AGGAATATCGGGGGACTGGTGAGATGGCTCAGTGGGTAAAAATGCTCACCAT 

1 5 gtaaacctgacaagctgagtttgatccccagatcccacAttaagatagaaga 

AGAGAACTGATTGCCCAGTGAGCCAGCAATCCTCTGACCACACACACACACA 
CACACACACACACACACACACACACAAATAAAAAGTGAATTTCTTTTTCCAA 
TGGGAAAGATGGGCTCAGTGGTTAAAGTACTGCCACACAAGCCCAAAGACCT 
GAGTTCAGATCTGAACCCTGTAAAAAGCCAGGCATGGCTTGTGTCTATAATCC 

20 CAGCACTGGGGAGGTAGAGATGGGTGGACCCCTGGAACTTGTTGGTCAGAGA 
GTCTAGCCTAATGGGTAAACTCTAGGATCAGTGAGGGATTTCTGTTTCAAAAT" 
CTATGGAGGATAGGGTTGAAGAAGACAGCTGATATTGGCTTCTTCCATACAC 
ACACACACAATCTTCAATGCTAAACAACTAAGTTTCCACCAGTTTTGGCTAAt 
ACTAGCTGAGCACACATGCCTCAGTTTCCTCATCTGTGAAAGGGAGGCTGTGG 

25 TGACATTCCCATGAGCAACACTGCATGAAGCATGTGATTCAGTGCTCATCATA 
CTTCCAAGAGCTGGAATCATCTAGACAAGTCATATCTACCTTCTTGGTACCTA 
CCCAGGTCCCAGAAACTACAGTAAGCTGCCCCATGAAGGCAAGATACTCAGT 
CAATACACCTACCTGCATCCTCAATGCCCACTGTCCTCAGAGCTCCTCTGGAT 
CCCTCC/OCAGCTAATATGTGAGTCGGGATATTCTGACTCAATGCTCTATTGT 

30 TTGGGTGGGGAAAGCACCTAGAGAGCTAACACAGCAAAACAGAAATGTTTCA 
AAGGCTGAGAGCCATCTGTCTGTCCTCCTTGGACCACCTCTCAAGCTCTTACA 
* CCTTCAATTGGAACCTTGTGTAATGACTGAGCATCTGTGTAGCCCCACCTTCC 
CTACCTCCTGAGGTTGTCCCTATACAATTGCACCTTGGTGTCCCAAATCCAGC 
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ACTATTGTGATAGGACTAGGCTGGACACCCATGTCCACTGCTTGTGCATACCT 
GGGCACCTCAGCCCCAGGTAGAGCTTTCTGAAGGCTCTGCGGCAGCTCTGTG 
AGGAGGAGCCTTGGATTTCACTGGGGAGGGGATCTCAGGGTATGTGAGAACT 
TGTGCCTCTCTGCCTGGAAGGTGCCCATCTTGGAAGGCCAGGATTAGGGGAT 
5 GATGATGGAAAGGCCTTTGGGAAGACTCCAGTCCATGGCTCAGCATCACGAG 
TCACTGTCTGCTTGTGTCATCCCTGTGTGAAGGTATTGAGGTCCCTGAGTGTC 
AGGATGGCAGCCCTGAAGCTGAGATAGAGGTAGGAGCCATAGAGTCATCCAT 
GCTCAGTGCACACGATGCTCCTGAAAGCAGAATGCCACCACGCTGAGTTCAA 
AACAAGGACAATCACTACTCAGGCAAGTGGAGGTACAGGGCAGGTATGGTCC 
10 CCGTACACTCTGAATGATGAAGGGCCCCAGCAAAGGACAGCAGCCTAGTTAG 
GTTGACACAAGCAGGGAGCCTGAGTCACACAGCAGCATCCCATCCCATAGTC 
CCCACCCAACGCTCACAGTGCTTGTTGAGCCAGAAACTCTCAACACAGCCCTC 
TATCCTCCAAATCAGGATCTAATGAGGAGAGAGGTCAGGAACCAGCTCAGCT 
TACAAAGCAGCTGAGCTGTGGCCTTTGCCCAAGTGCAGGTAGCTGATCCCAA 
15 TATACACAGACCTTTCTGGGGGTCTGCACCCCACAGGTCTGTACCTGAAAGTG 
AGGATGTGCGCACGTGTGCACACACACACACACACACACACAGAGAGAGAG 
AGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAAACACAATATGGATG 
AGGAAGCAGGCTGACTAGCATGTATGAAGTTGAATGGATCCTGTCCTGTAAT 
CAGATCCTTAGTGGGGACTGCTAATCCCAGCTCTGCTGTCCACAGGGCTAGCC 
20 TATGCTTCCAACCGTGTGGTTCTCTCTTTCTCCCTTACGTGGGTCCCAGGGATC 
AAACTCAGTCATCAGGCTTGTGCAGCAAGTGCTTCTAACTGCCATGGATCTTG 
GGTCCTGTATTCACTTTTAAATCTTCCCATGAGTGCCCCATGGACAGGGGGTC 
CAGAAGCAACTTCACCCAGTAGTGCTGGAGTCTCATCTACCACGTTTTATGGT 
CACAGTGGGGAGGGCTGTTATTGGTTTCTACCAGGGATGGGTGCATAACTCC 
25 ACACTGGACTCTCATCACAATAGACAATGTCACCATTTGGCAGTGAAGAAGC 
TGCCTCTATGAACTAGACCATCATGAAGCCAGATCCCATTAATAGCTGAGTTC 
CACAGAGGCTAAGTAACTGCTAGGCACAGAGCCCAGCCCCTCTGCTACCTGG 
CCTGAAATGATACTGGCCAGCATTCTCACAACCCAGCGGGCTGCTGGCCCCAT 
GCTC ACCG AAGG AGCTCTAG ATC CATAGGGGTAGTTTCTCCTGGGGGAAGGT 
30 GACCCAGCCCAGATATTCCCAGGCCAGGTAGAAGGGCCAGCTGCCACCTGTT 
CACACTCCAGGACCAGCTAGGGATGACAGCAGAAAGCCTGCCGCTGTCTCCA 
TGGTAACCCTAGCCAGGCCCTCACTCTTACCCTGATTTCAGGCCCATCACCAC 
ATTCTGCTTCTGAGCCCCCAGACTTTCCTGCCGAGAGGAGACCTCTCTGGCCT 
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AGGCAGGGGAATCAACCACCTCCACATGGACACAATACCCACGAAAGCAGA 

ACCCTCGACAAAGAGTCACCTCTTCTTTGAAGGCTTCTACTCTGGACCTCAGT 

CTTGACAGCTATAAAATGGGACATCTGGGCCTGAGCTTGTCTGTGTGGCCTCC 

AGCAACCACGGCCGGCCGCCACGCTGTGAATTGACATGCCTTAAGCAAGAGG 

CTCCAGAAAGATTCGAGTCCCCCTGGGCCAGCTATAAGTGGGCAGGTTCCAT 

CAGGCTTTCTTCCGTTCAGCCTCTGCACTTCCCCAAGAGGGTCTGACTTTCCA 

AAGAGGTGGCCCAACCCAGTAGCTGCAGAGCTCCAGGGGAGGGGGTGGATC 

AGGTTACAGCCAGCAGGTCACGATCTGGAGCCATCTGGTAGAGCCCAGCTCC 

CTGCAGGCTCTCAGGAATGCTGCCAGCCCTATTTTCAGGGGTGTGACCAGCCA 

TGGCGTGGTCCCAGCCTACCCAGCCAGCATCAGGATGCAAAGCAGACCAGTG 

AGAGGCTCTAGGCTACCCTGCATTCACCTGAAGCTCTGGGAAACACAGGGCT 

GCCCTGTCCTGGAAGAGAGGCTAGTAGAATCCACTCCCTCATCCTCACACAG 

GCCTGGCCAGTtTCCCCAGTGTAGGGAAATGCCTCTGGCTTGAGCCCGGCTCT 

GCTGAGGTCACCCACAGAGGATGAGGCTCCCTGGGTTGTGAAGAAAACCTGC 

TTTAGTCTTAGCTGGACAAGCCACCGGGGCAGCAGTCTGCCCAAAACAGGTG 

CTTGTGCAATGTACTCTCCTGAGCAGTGAGTGGGGACAGGGTGGGGCTGCAC 

AAAGCTACCCAAGCTGTATCAGACAAAGGGCCACCCAGTAGGCCCTAAGGCC 

ACAGCTTCTGCCACTGCCCTGGGTCCCTCACTAAGACAAGCCATAGAGGCAA 

GGGGCCCTGGTGGGGGCCAAAGTCCCTTATGGCTCATACCCTCAGATGGCCT 

GCTACATGCTGGGCAGAGAGCAGCATTTGGCTGCACAGGGAACCTCAGGCCT 

TCTATCCAGATGAGGGACTTGTGACGCTGGCTGGCCACTCAGGAGAGGGCCA 

CACAACAGCACAAGTCTAGTATTTAGGTTCAGAACACTTTCITrCCTTTGAAG 

CCTCTGGATAAAGCGATCCTATCACTGCCTGGGAGGGTGCACTGCTCTTTCTG 

TCCCTTGTGACAAAGAGAAGGTAAGAGGTGGGGTCCCAAACCCTGTATCTCT 

GGGCAAGCCTTTAGAATGGTACCTGCCCTTTAGTTTCAAGCCTACTTGTGAGC 

TACCTGCCCCACCCTGCCTAGGCTGTTCCTCAACTGCCCCCAGGCTTCCAGAT 

CCCCAGAGATCACTTGGTCCTCCTGGTATGAAGCATGTACTAGTTAGTCCCAA 

AGAAAAGCTGGCTTTCTTCAGACCACAGCTCTGCCCACGCACATCATGCTAAC 

CCCAGACTTCCCTAAGGATAGGACCTAGGACTCAGGfTCCAGAGCCCAAAGAG 

AGACAGTGCTTTGGGGCCAGGAGCCTATGGGACCCCtCCTTCAAGCATCTATC 

TATCCCTACCCCAACACCTCTCAAGGATCAAAGCTCATTCAGTCAAGAGTTTT 

CCACCTAAAATATCAGGAAACACTCCCCACCACCATGGGGGTGGGGCAGGTG 

TCTTATTAGCTTITTTTTTTTTTTTTAATC 
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TTTTTGTTTTTTTTTTTTTTTTTGTAACACAACAGGGATTCTTTAATGTCCCTTT 
CTGGCTCTTTCCATGTTAGATGACCATGTATCCTGACCCCAGGGTAAGCATCC 
TGTCACTTACCAAAGTGAGGGACACTTTTCCATGAGTGTCAAACCCAGCCCTA 
5 ACTGTTTCTTTCACATGCCCTGACCTGACCCTGAGTCCCTTGTCTGAGCACAGT 
ATTCACAGTGCCCTGCAGACAGAACCTGCGGCTTTGGGTTAATCTTGCAGCCA 
GTTCACTTTGTCAGGAGCCCCAGGTGCTGCTCATACACTGAAAAGTTGCTTAG 
CTAGGAGTATATGCTGACCAAGGATGAAGTTGATACCTCCACAGGCCCCTGC 
AGTGGCTAGTTCCTGCCCCAGCCTCCAGCTGAGGCCCTCCAGGAGACACAGT 
10 GAGAGCACACTTAGGTGCCTGTGTGAGCCTGGTACCCCTGAACTTCTGTGAGC 
CTGTCTGTGTCCTGTTCAATAGGTGCCTGGCACTTAAGCATACCCCACACATG 
ACAGCCATTTGTCCCCAACCCTGGAAGAGGCTGGCTTGCCCAAGGACACCCG 
CCCAGGGCGGTATGACTTTGAGGCCACAGGCAGCCAGCCAGGGAAGCCTTGA 
TAAGCACAGTAGAGGTGGGGCCGTCCTGGGAACCAAAGCTGGGAAGCCGCA 
15 GAGCCCCCAGTCTCCCCCCTAACTCCCCACCCCAGCCCTTGTGCCAAGGACAG 
CCAGAACATTCTCACTAGCCTGGGGAGGTGCCACACCTCAAAAATGCTAACC 
CAGGGACCTTGGTCTGGGACAGAGAGGGCAAGTTTACACTTAGCGGGAAACC 
ATCATTACTGCACAGGTTGGCAGGGACTTCTCAAAGGCCCTTTGTATACCTGT 
CTTGTGTTTTACCTCCTGGCCAAGTGAGGTGGGGATCAGGTGTCCCAGGAACA 
20 AAGCGGGGTGTCACCAGGTCACCCAAGATCATGGGATGGGCCTGGGACAGAG 
TGCCAGCCCCACTGTTCTGTCAAAATGCACACCTGGGCCCCTGGAACCCCACC 
ACCACTCCCGTGCTTTCTCTGTGGCGGTTGAGACATTTCCTGGAGGTTCCCTGT 
GGCCCTTGCATTGGTTCTGCTGCCCTTAGCTCTCTGGTGCACAGCTGTCACAC 
CGG.CAGTGACACCGGCAGTGCTACAGATTTAAAGTCGGTTGCAACCCTCACC 
25 AACTAAGACACTTCATAGCTGCTCAGCTAACTAGGAAGGATTCTGGGTGTCG 
GGTCTGGCCCAACTGCAAGGACAGAGTTTCTGCCCTCTCTTTACAGCACCCGT 
CACCCATGCCATTGGTGGAGAGTCTCCTTTGTGTTCCTGTGAACCACTGGCCA 
GTTCTTCCCTTCATTTTACCTCTCAGCTTGCTGTTACTATTGCACTACCCTGGT 
: CTTTGGCCTGCTAGGCTCTCACCTGCCAGGCCACCCACGCCCAGAGTTCTCTt 
30 GTGAGACTGGTCCACTGGTGCCCTAGAAACrCATATTTCTCTCTCATGGAAAG 
GGTTTTCTGAGGACATGGTGGGATACAACAGCTGCCTGCAGATCACAGTGAG 
AGCCTGAGAGGCTGCTCACTGCTTirGGTCACCAGGAACAGCCTTTCTAGAAA 
CCCAGAGCTCTCCCCCAGGCACTCAGTCCCAGGCAAGTGTAAATGTCAGGGG 
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AGGGGTACAAGAAAGATAGTAACCCTGACATGACACCCCXGCCCCTGACATA 

CACACCCATACCCTTGTCCATCTTTTCCTCTTCACAGGAACACCACATCCAAT 

CAACCAACACACCCACTGATCCAGACATGCATCCACCCATGCATCCTGTGATA 

GTCTAACACTCCAACTTCCTGAGCCCTAGAAGCCGGACTTGGGAGCAGGGGC 

CACATAGAAATAGCCATTTAGTATTTGTGGACTGCCACACTGCCCACCCAACA 

CACATGTGCACACATGGAAATATGGACTCCCATCTGGGCCTCTCCTACTCAGT 

GAGTGTCATGGAAGCTATCTGGACAGAGTTAGAGGCAAGCGTGTCTCAGCCC 

TACTCCAGATCAGCTTATATGGTTGCCATAGCTTATCTCCTCTAAGCTCTGGA 

GAACTGCCTCTCTACAGACCTTATTTCCAGAATATTCTCAACATTTACAAGGA 

GATGACTAGTTCATATGTCCACTTAGAAATAACTCCCTCCAATGCCAGAGACA 

CAGAGGTGTTGGGCAGTGAGAGGGTAGCCTTGGGGGCCTTTAAATGGTCATA 

CATAGCAAGATGAGGAGCCCAAACGCATGATCACATGACCTGTTAGCATCCC 

TGAGATAGAGGCCCAGACCCTATCCACATGTGGGGAAACAGCCCCAAAGCAA 

GGTGACCTCCCAGGCCACACAACCCCAGTACGAAGTCTCTGGGATGGTGCCA 

GACCCATCTCAACTGCCGCCTGCCACTACTACTTGCTCAGGAGAGGACATTCT 

GCAATCCAGCAAAGGCGAGAGGCCAGCTCTGTGTACATGAAGACTTCCTGTT 

CTCCAGTGTCACCTGAGAGACCAAGTTCCACAGCTGTACCACTGATTCCCTTC 

CACAGAGTCCCCCCAGAAAGATGGCCATTGTATCTGTCTGGTCAGAGAAACA 

CAGGCAAACCATGTTGCCTTTGCTGCCTGGGCCTGTCTTTTGTCTATCGTCTTC 

CCAGGGACTAGGTCACTTGAGGGTCACTCGCCATTTCCTGCTTTCTACTTGGC 

TGGGAGGATCTGGGTTTTATGGTTTCTCTGGAAACCTCCACTGTGTACTGCAT 

ACAGAACCCTAGAAGGTGAACCCACAACTTGTCTGGGTGCTCAGTGAATCCC 

AGCCTTCCTGGGATCAGAGAGCTTTGTGAGAGTACCAGGCAGGGCTCCTGAA 

CTATGAAGCTAGTGCTGCCAGTCATTGGTAGCCAGGTGTTCAATCTGGTCTGA 

CTCAAAGACACCAAATTTCCTGTGTGGTTTTGTACACATCCCTGGCCTTCAGG 

TTCTCACAAAGAATAACCACGCCCATCTGGCTAAAGCAATTCTAGGACCCAG 

ATGTGGTGTGTAGACTACAGCTCCCCCGCTGAGCCATGGAAGGAAGGCAAGC 

CATATTGGATTCTGTCAGCCCTGAATCACCTAGCCAGGGTACAGAACAAGAA 

CACAGGCTTTCC'iGAGACTCTCGCCTGCATAGAGGGAACTGTCTGAAAGCAG 

GCAGTCTCACTGCTCAAGGAGGGGCAGAACCATGTGGGTGACAACAGGTAGC 

AATCCACTCCAGGCTGTCCTCTAGAGGACCTGTGGAGTAAACAGCAAACTCA 

CTGTGCCTGGGCCAAGCACCATGATATTATAGCCCGTCCCCAGAGAGGGAGA 

GAGGGACTCTGTAGGAGTGACCCTACAGAACAAAAACCACCTCAAAGACTTT 
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TATGCATGCCTCATTACTTGGAAACTGGGAAGGGCCTGGCGAGTTACGAAGT 
GAAAAACTTCAGGCAGGGGGGAGGAGGAGAGAAGAGGGAGGAGGAGGAAG 
TAGGGAAGGTAAAGGGAGGTGGGGGAAGGGTTAGGGAGGGGGAGGGAAGT 
AGGAAAGGAAGGAAGAGGAGAGGAGGGGGATGGGAAGGGGAGAAGGAGGA 
5 AATGGAAGGTGGGAGGAGAAGGGGAGAAATAGAGGAAGTAAAAGCATGAC 
AGGAGAGGAGGAGTTGAAGGGGTGGGGAATCCTTGCCCACTTTCCAGACAAC 
GATCGCTGGACTTCAGGTCAGCACTGAAGGCAGCTGCAAGCCTTGGCTGTCT 
ACAGATAGGACCACAGACTTGCTGTATGGAAAGTTQAAAAACACTAGGCATT 
TCTTCCCTTTCAAGAAGTATAAGATAGAGCCAGGGGTGGTGGCACACGCGCC 
10 TTTAATCCCAGCACTCGGGAGGCAGAGGCAGGCGGATTTCTGAGTTCGAGGC 
CACCCTGGTCTACAAAOTGAGTTCCAGGACAGCCAGAGCTATACAGAGAAAC 
CCTGTCTCAAAAAAGCAAGAAGTGCAAGATAGACACCAGACCATAGGAGAT 
ATGTAAGTGGCTTCAGCCAAAGTCAGGTGGCTTCTCTGACTTAGAAATGTGAT 
CTCCACCCCCACCCCCACATTCCTGGGGGCCTCCAGCAGGGCCAGGGCAGCT 
15 GTAAGTAAGCTGTAAGCTTGTACATCTTCCAAGGTTCACCAACAGCCCTCTGG 
TTTTCCTCCCCTGACTCTTGCCCTAACATTTCTACATCCCTCCTGTCTCCTCAC 
AACTCTCTGAGCCAGGTGGGCATGCTACTGCTCCTGAGCACTGGTTAGCTCCC 
CAATGTTAGTCCTCCAGAGTTCCAGGTCACATGAACAGGGCCAATTCCAGGC 
AGCACTCAGCTCCTATGCTCATCGCCACATAACAACTTTGTAAGCACAGATAC 
20 GATGGGCCATTCTGGTTAGGTCCCTTCAAGTATTCCCAGAAGCCTGGCCTTCC 
CATCTCCCTGCCAATATTACAGGGCTGGCATGAAGGAGCACTAGAGCACTGG 
GCATAAGCCTCAGTCTATCTGCTCCATCC.CTGCATTTTCCTGGCATCCTGTCCT 
CAGGGAGCAAGGGAACTGACCACAGAAGGATCATAGCCACCTCACCCTCTCT 
CACTGGGCAGAGCCAGTTCACACAGCAGACATAGACCCAACAGGTGACACCC 
25 ACTGCAGAATGGAGCATACCCTCGAGCAAGGTAATCTCAAACTGGGTTTTGC 
AATTTGTTTAATCAGGACCCAAGAGAGGGCACATGCTACACACTCTCCCAGTC 
CTTCTACCCATCTCTCCTCCCCTGCATATTTCGGTCCTTAGGCCTACAGGTCCC 
CTCCATCTTAACTTGCACACCTCAATTCCACCATGAGCACCAAAGCATGCTCC 
TCTTGCCACAGTATTTGCCAAAAGCrTC'ATCAGGAGGCTGTACTCAGCTCATT 
30 TACTCCTCCAGTAGTGATGCCAATGTAGGTCCTGGTTGAACAATTCTACCCAA 
TACCTGCTGTCCCACCCATCTAGATTTTGGACAAGTTGGTAACTGAGACCCTC 
AAGGCCGAATCTGTGATTTCACCAGGACCCATGATGGTGTAGATCAGCCCTG 
AGGGGCTTTTGACATTAGCAGGATTTGGCAAGGAACCTCCCCCCAACCCCCC 
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ACCCAAAGCTGGGACCTCTTCAGTGCTTTGGTTTCCCCGTTACTGTTACAGTG 
GGAAGCAAGACAGGTGTATGCTGGGTGAGTTGTGGGCTATCACCTATTCATCT 
CTGAAAGAGATGGAAGGTCCTGTCCCTTAGTCACTAGGACAACTGGGGATCT 
GAGTTTCTGCAATGCATTCGGACCCACTCCAACAAGACACACGTTCAAGTCTT 
5 CTCCAGTCCTGTGCTGTGCCCTGGCATCCACTGGCCATTTCCTCACCTCCTGCT 
CCAGGAATGACTTGGGCACCTGCTGTGGCTGTAGGCTCAGCCATTCCCTAGGA 
GTTAGATCTCTAGAGTTAAGGGAATTTGGGGATAGGTAGGCTCACAGCTTCTG 
AGGAGACCTTGTTTACTGCCATTCCAGAAAACAGAAAAGTCCAATC ATAGTG 
AACGAATACTGATAACTGTCATTCATATGCCAGCCGATTCGGGATCAATGCCC 

10 TCTAACTGTTGTCATGATAACTATGCACTAGCCCTCACCCACAACTCACAAAC 
AGTAAATAGCCTCAGAGCAACGAGAGAAACAATGCCACCCCATGGCATGGAC 
TCTGGGAGCCTTGCATTTGTCCTGAAGAAGTGACTGTGTAGATGTGAGAGTCC 
ACATTTGCCTCTTGGTACACTTGCTTTGTTTGGATCCAAAGGGGCTGTTCTTTT 
TTATTTTCTAGTTGAGAATCATTCTCAGTATGGTAGTACATCTTTAATCCAAGC 

15 ACTCAGGAAACAGAGGCAGCTGACTCTCTCTGACTTTGAGGCCAGCCTGGTCT 
TACCTATCTAGAGCCAGGCAAGCCAGAGGAAGTCTTTGTCTCAAAAAACAAG 
CAAGCAAAACAAACAGCCCCCATACACATCAACACCACAACAAATTTGCAGC 
TTTTAGCAGGTCTTTTTTCCTTTCTTACAATATGTTATATTTAGACAATCCCCA 
GTAGAGGGAGCAAACAAGGATTTGATCATTTCTCCTCTCCTGGGTCTTTTAGG 

20 CAATTTTTATTTGTTTTGCTTTGTTTGAGACAATGTCTTGATATGTAGCCAGGG 
CTGGCCTCAACTTGTGGAAAGCCTCCTGCCTCAGCCTCCCACGCATTGGGATT . 
GCAGATGTGTTCCTACAAGCCTGGCGCCATGATGATGTTTTCACTAAAGACTC 
ACCCTAAACCTGGGTGTGGTCCCTTGTGGCACGTGTCCATGTGGCACTCCCCA 
CATAACTGTTTCTCCGGCCCACTAGGAAGTGATCTCAGCATGCTGCGTGACCA 

25 GTGAGCCCCATCTGACTATTTTGTGCTTTAGCCATCTGATGATGGGGACATCT 
GGTTTGAGATGCTGCATTTCCTAATATGGATTTTTAAGGCTCAGTTGCCTTGG 
CTACCAAGCTCTAAGCACATCACTTCTTGGGAGTGCTTAGGCACAAATGGACT 
TCCTCTGGGCCCCAGGTTCTCCCCTGTCAGGTGGATGGGGAATCTTGGTGAAG 
GAAACATGGGAATAAGGGTCTCTCTTTCTTGGGAAGCTGCAAy:CAACATGGA 

30 AAAAATTGTACTCTGGCCCAAGATAAAGTGGGGGGTCCCTGGGGTGCTGGAG 
GGGGTTCTCTATAAAGGTAAAACCTGAAGAAAGTTGGAGGTGGCTTGGGCCT 
CTAGGCAGCTGAGGAAATCTGACTGTGGCTTTCTTTTTTGTTTGTTTGTTTTAT 
TTTATTTTCAAGACAGGATTTTTCTGTATAGCCCTGGCTATCCTGGAACTCACT 
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CTGTAGACCAGGCTGGGCTTGAACTCAGAAATCCTCCTGCCTCTGCCTCCCAA 
GTGCTAGGATTAAAGGCATGCACCACCGCTTGCCCAGCAAAATGATACTTCA 
GACAGCTATAAGTCATCAAACGGTTACTGGGATTTGAACCAGAGTCCATAGG 
ATCCACAGTCAGTGCTCTTAATGACTGAGCCACTGCTCCAGCACTGACTGTGG 
5 CTTTCTAATTGCAAGTTCCATGAGGAGGTGTCTGGAGAGTTACTCTGAGTGGT 
TTTTCGATTTGCCCAAGAACATAGTTTAAAGAGGGTACAGAGAAAGGCATCC 
ACCAGCAAGAAGCATAGAGGACAAACCTAAAGGTCTCTCCTTATAGATGAGG 
_ — TGGGGAGA^AFAAGGTAGGAGGTAGIATIXj GG AAA GG AACAGC-CAXCAGCAG 
TGCTTCCTTGGGGCTTATGGATAGAGGCCAGGCAGAGAGAGGGTGCCTGTGG 
10 GAACATGGTACACCTGAGGTTGAGAGCAGAGCATCTCCTTGCTGGAGAACAC 
TACATCGTAAAGTCCTCTGGGCAAATCGAAGGAGCCCAAAGGAACTAGTAAT 
TCTCTAGCAGTACATCCCAAAGAAGAGAAAAAAGATAGCCTGCCTGACAAAG 
AATTCAAACTAGTGTTTTTTAAAAGAAACTTAATATGATAAAACAGATTATAG 
CAAGATAATATAAAGAAATTAGGAAATTTTTGTTTTGTGATATGAATGAGAA 
15 ATTTAGCAGAAGGCCAGATTGGGGTAGGGGATCTGGACTGAAAGAGCAAGAT 
ATATGAAGTAAAGTATACAATTAGGAGTCTTGATGCAGAACAGATCAGACAG 
AAGAGTTTCTGAGCCTGAATACAAGTCTAACAAAAAGGAAAGTCAGACCAAA 
GATATGCAAAAGGAAAGGAAAACTTCAAGACCTATAGGAGAGCAGAAAGTG . 
AGTAAACAGATACATTGTGGATGTTTCAGGATAAGGAGAGACAGGAAGAAG 
20 CAGAAATAAAATTGTTTACAAAGTAATTACTGAAAAACCTCACAATTTTTGGG 
TGAAATACAGACATTTAAGCCAGTGAAGGAAGCTCTAAGGACCCCTATAGAT 
ACAACCAAAAGCACTACAGGGATGGACAGACTGTTTAGTGGTTGAAAGCACT 
GCCTGCTCTTCTAGGGAGCCTGGGTTTGATTTCCAGTATCCACATCAGAATGT 
GCATCAGTTCTATGGATCTAGCACCCTCTTCTGGACTCTGCAGGCACTGCACA 
25 CATGTTATACACAAACACACATGCTGGCAAAATATATCCATACACATAAAGT 
AAAAAAAAATTAAAGATACTTTACAAGGAACATTATCATCAAACTACCAAAA " 
ATACAAGGCAAAGAATTCCAAACATAGTAAGAGAGAAGTGCCAAGTTACATA 
AAAGGGAATCTTCAGTATACTAACAGTGAGAAGCCTTTTAAGCCAGGAAAGA 
ATAt;<3ATGATACCACCAGAGTtTTAAAAGGAAATGAATGAATGAATGAATAC 
30 CCATCAACCAGGAATATGATACCCAGCAAAACTATCCTACAGAAAAAAAAAA 
GTCmCTAAAGCAAGCAAAACTAAAAGAACTCATCAATAGATTAGTCCTAT 
AAGAAAGGCTTCAAGTAGTGCTCAGAAAAAACAAATATTGTCTCTGGTCTTTT 
CCTAGCTCAAACTGTAAGCCAAAATAAATTCTTTCTTAAATTGCTTCTGTCAG 
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ggattatatcattgcagtgataaaaataactaacacggagactgaaagttca 
gggatgggaaacacaactcaggcaaatgaaagcaaaagcaagcagtaggca 
cacttacatcaggaaagtcaactgtcagtcaaagactaaatagagacaggga 
aggctggccaagggatggagtcaacaggtgaattcctgcacccattagatct 
caaaggagaaacattatctaacagaatgacagttaaggacaaactgtatcaa 
tggacaagtagtccaggaaaatgtcagtgcaaaaatgctggagttaaaatac 
actatagactaaaaggatctacaagttgttcacaaaataccttctcaccagca 
-caggga^a^ggc6tacagta j i ! a^aggagatgggagaccacaaagtctcacaca 
cacacacacacacacacacacacacacacacacacacacagagagagagag 
agagagagagagagagagagagttagaactggtcattgaagttaacaaaga 
tgtacgaacaaaatgcatgtttaaaacttcatagagctccatatctgaccact 
aacttgctaggactaaaatcatgatagagcctgctcatattagatacaaaag 
gacagatatgggtaactaaaaaggcagatatctctacagtgaacattacaat 
gacaagaaattgaagagtgtgcagaagatggtcatagcatgtttgtggctcc 
aaaggaaaatgtttaaatgtccatcacggcacaaagcaatgtacagattcaa 
tgcagctactaacagaatatcagtggcattttcacaggactaaaaataagtc 
cgaagattggtatggggccacagaagaacccaaatggatcaggaaattgcat 
tcccaaagagcaatgctgaagatagcataatagacaacttgaaaatatcctc 
aggctggggacgtggctcagctgacagaatgcttgcctagcatgcacaagat 
20 tctgggttcaatctgaggcagaaaggagaggctagggtttaaaacagatggg 
atagtatctattacagggtgtttagtgcacatctgtaatcccagcagttggga 
tgtagtgaagggagaatcaagagtctaaggtcatccttggctacatagtctg 
agtctgaggtcagcttacattacattagaccctgttttaaaaacaaacaatgc 
ttcatagcttgaacaaccaaaacagcatgaacctagcatagaaatagacaca 
25 atatgaggctgaagagatgacccagtagttaaatccacctgttgttcttggtt 
aggactagggttcagttctcaacaccctcatggtaactcaactgcctgtaact 
ccagttcccagagatctgaaaccctcttattacttctaaggtaattggagtga 
gcatactcaggtgcacgcgtgcatacacacacacaacacacacacacacact 
cAttcactcactcctgcta-.:agacggatggggtgggcaattttacaaatggtc 
30 tcttttccaggacacaatgcctttgcctggatctttctggaaccacatccagg 
tgtgttctgctcagttcccacaaccagacctggggactttccaagaagtagcc 
agaaagtgggcaagcaccatgtgagtgcttctacctcagatctgcctgtgctc 
agtcagggcagtagctaagacagaagccctggctttattagaacattaatgg 
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GTAGGAACCTCAGGACTATCCCCAGGCCACTGAAAAGCCCTCTGGCAGCCTT 
TGCATCTACCCACCACTTTCTCAAGCTCAGGAATGCCACTGGATAAAAATACT 
GCAGGCAGATGTCATCAGTCTGGCCTGTCGTGGGGCTTAGAGTTTTCCAGGCA 
GCAGTTGCCACAGCCCAGGCTGTGAGGGTGGGGTAAAGTAAAAGGTGAAGA 

5 AAGCCTTTGAAAGGAGCCAGACAGCTCTAGGTGGCCCTGCCTGTATGGCCTCT 
TAGCACTCAGGAACATCAAGATACTTGCTAACACCACACACACACACACACA 
CACACACACACACAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGACAGA 
CAGACAGACAGACAGACAGACAGACAGACAGACAGTTTTACCAATGGATTCT 
TTATGTGCTTGAAGGCAAGGCAAGGCAAGGGGACAAAGACAGAGGCACTGA 

1 0 GCACATATGTGCCTGGGAAGGCTTCCTGGAGGAAGCAGCTTCTGAACATTTG 
ACAGAGTGTGTATTGCGTCTGACATTTGTCATGTCAGACTCCAACTCATTCCC 
ACAGGACAAGATCCTTCCATTCAGAGACTCTACTCCCAGAAGAATAGCAATC 
CACTCCTTACTTGGTTTGATCAGCGTGTGGTTTATGAGGAGGGAGCTCCTGGA 
TTCTTGGGCATAGCCAACTGTGTGACCTTAGGCAGATAAGCTACTGTCTCCAA 

1 5 GTTCCAGGGTTCCCAGATGCACACAGCAAGTAGCCATATTTGACAGGCCCTTG 
ACAGAGCCACTGCTACAGTCCCCTCCACCTCTCAAGAGGCCTCTGGGCAGGG 
CCAGGGAACGGGGCTGGGCCACACAGCAGAAGATAACGTGTTCACAGAGGC 
AGTCCTTTCACGGGCTCTGAAGCCTCATGAAAAGTCAGGTCTTGTGATGGCCA 
CAGTCACCCTGCTGATAGAGCAACCACTGGCTCCTCTCTGAGGCTTGCAGAAA 

20 TCTGGGGCTCTGCATGCTCCAAGGGGTTATGAAAAGGGATTCCCATGAGGAA 
GAGCAAGGAGGATACATCCCTGAGTGCCTCAGGTAGGATGAGCAGAGCTTAG 
ATGGCATAATCTATTAAGTAGCACCACTTACTCTTTGGCCCTAAGGCCCTGCA 
GGGCCACAGCCTACCTCCTGCTCTACCTATTACTTCTTATGCTTCCCCACAGTG 
TCAAAAGGGCCTGCACCITCCCTCTGCAACTATTCCTTAGTTTCAGCATCCAA 

25 GTGGCCCCTGTGGCCAGCACTTAATTTCCTTCATTCTCCCTACCAGAGGCATA 
AGAGCTTCTAGAAGCTTCTCTACAACCTTTCATTGGGACAGACCACCCATCCC 
CATACCACCTGGACTCCTTTTGGTCCTGGAACATGCCCTGGTGTCTGCCCCAG 
CCTGTCAATCTCATGAAGAGGGAAATGAATCACAGCAAGGTACATAGGAGGC 
AACAGCTGGACTGGAATGTCAGAGCCAGGAGC'GACCATGGGGCTAATCAGG 

30 GTCCACTTCCTAGATAAGGGTGCAGAAGAGAGAGGGTGGGAGGAGCAGGGG 
CCCTTACACTGGCCATGACCAGAACTGTGCATCTTATTACTCAGGAGCCACTT 
GATAGAGACAGCTGCAGCTCTCCTCACTGCTAGGCAGACCTGCTGTGGATGC 
CACCCTGCACTGTGCTTTGTTAAGTAAGCACTGTTTGCAAAGGGGACACCACA 
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CCCAGGAAGCATGAGCTCTGGACACTACCTTTGCCACCACjACjCjCjLTlC'l'AAAU 

ACCAAACTCCCTGAGAAACAGAGGCCATAGCAGCTGGCCTGACAATGCCCTG 

TCCTTGTGACACCATGGGCTTCTTCCCTGCCTCAGCAGCTCCCAGTACTACCTT 

AGCCTGGCATAGGTCGCCATGCCCTCATGTCCAGGGTAGGGAGGCTCTTCCTC 

AGGTCGGAGTTCCATGGGATTGTCTCTTCCTACTACCCAGCTGGGAGCCTTGC 

CACAAGGGATGTTTATGGCCACCCAACTCAGAACAAACCAAGATGAAGTGCT 

CCACACACAGGCACACAGTGTACACAGAAATGCCAGGCTGGGTCTGAGGACA 

GGTTGAGGGGTTATGTGTACCTGGAAAGTCAAGATGCCAAGAGAGTTAGCTG 

ATGACACCAAGGGACCAGGCCCCAGAGGAGATGGCATGTGGAGAAGCTTCA 

AATTTTATCTAGGGTTGAAGAATAGGTAGAGGTGTTTATAGTAACCAGAAGG 

AAAGACTTGGTCATCTACACAGCCCTCACTGCTCTGTCCCCATCATCCTGACA 

GCCATCTTCCTAGGTCTGCTAACTAATTGCCCACTGGCCACTCCATGAGAGTC 

ATAACCAATACACTGCAGCCCCTGCAATCAAAAAGGCCAGCCCATTTGACTA 

CCAGGAGTAGCTGTAAGCTCCACTCACGATCAACTCCCTGAGGTACTAGCAA 

AAGCCACCAAAAAGTTCTCAGGGAACTGAGCAAGCCAGAACTGTGAGAGGA 

TAGAGACAGGATGATCAAGTTCAGGTTCAAAAGTGAGGGAGGTGGGTAGTAT 

CTCTTAAACCCCTCTGTTGGTCACAACCAGGGTCTCCAAGCAGCTTGACAGTC 

AGAGTAGATAGTTTGGAACATCTGCCCCCAACCCTGGCCAAGGCCCAACAGC 

AAGGCAGCTGGACCCCTGTGACCTTGGCTAGGAAGGGGGCTGCTCTCTATGG 

GAACTCAAAGAGCCACAAGATAGGATTAGGGTGAGGGGGTGGGGTGGGAAC 

CCAAGCAGCAGAGGGCAGTGCCTGGGGACCAGTGTCCTGACCCTGTCAAGCA 

GGGGAGGTCAGCACAGACACAGGAAGAGCCGCTCTACTTGGGCCAAGGCAG 

GGGTCCCCCAAGGAGGGGCCAGGATGTCCGGGTCTGTTTCCCCATACGCATG 

TGGGAAAGCATACCCCAGAGACTGCAGGACACTGGCTCTCACCCCTCCACCC 

CCTCCACCCCCACACATTCATGCCCTGCTTTTCAAAAGCAGGCTTGTCCITTG 

GGCCACAGAACCATCCCTGTATGTTTTAAGGGTCCATCTTACCAAGTGCGGAG 

CTCACAGTTTGCCAGCCACATTTGCTTCTTAGAACATCACCAGGGACAACCAG 

CATGACCACACATTTTATAAAGATGTTAGGCTCCCAACTGCTGGCCATGACTC 

TTTTCTGGAAACTGCCTGACCCAGAGGGTCCTGCCCACCTCCTGAAGG7GATG 

GAGATGAGATCCCTCCCCCTAGTGTCTGAGCAAACCACATGCTGTACCCAGC 

AGGGTCCTCCCAGCTTACTTACCCCTCCCTCTTTGTAACTAAACTCCATGGGC 

AGCCAGGGCTGGGCAAGATTCACCGACCACCACTCAGGAGCTAGCATCTCTC 

TCCCTCTCCTTGACAAAGAAGATCTTAGACCACCAATCTAGAAGCCACCAGG 
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GCTCAGTGGCTCCTGGCTCCTGGCTCCTGGCTCCTGGCTCCTGAGGAGGCTGG 

gcaaaggcaagctctgctgagcaaacttcccgagggctgaacaatccaattc 

AGGAAGAAAGCACAGTGTAAGCACAGACTTCCTCTGGGGCAGGACAAGCCCT 
CTTCCtCCACCAGGTCCAGGCCACCAGAGTCAGGAACTTAACAACGCTCATTC 

5 TAATGACCTGCATCCCCAGGAGGATTCCATGCAAGTGTCTGAAGGCTGCGCA 
CCTTGGCAAATGGGGCTCCTGCACAAAAAGGTAACAGAGTAGCGGCTTCTCC 
AGCACCTTTCATCTCAGGGATCTTAGGATCCAGCAGGCAAGTTATAGAAAGC 
_ A QC A A CiC.C A.TG TTTGCT AG AC ATTCCT AC AGTGGTT AAGGTTAAGGCCC TTTG 
GGGTCTTTATGCCCTGAGAAAAGCCCACTCCTGCTGCAAACACATGCAGTGCC 

10 TCCATCCTCGAAAGACCAGCAATGAAGCCTCAGGGGTCCTGGCACTTTGGGA 
GGTAAGGCAGGGGGTTGGGTGGCCTTAAGGACATACCAAGAATGCCAGTGGG 
GACAGGGTGTCCAGAGACTTGCCCTGGATACCCTGGGCAATCGTCTCCCTTTG 
CAAGGCAAACAGCCATCCGAATTCACTTTTGTGAGCCTGGTTCTTGCTGTGCC 
TCAGAGGCATAAACAGACCTCTGGACAACCAGTGGAGGTCCTCCCCTGCCCA 

15 GGTGGCAAGACTAGCCACAAGGAGGTCACCTAGTGGAACATCTAAAGCAATT 
AGCTGCGGGGTCGGTTGAGTAGAATCCTGAGCCAGAGGCTGGCAGGGCACCA 
AGAAACTAGGCAGCCAGGAAGTGCTGCCTCGAGGTGGCCAGGGGAGGCAGC 
AAGGAGGGCAGCCAGCCCATTCCTGGAGATGACTTTCTGTGACCCTGCCCTTG 
AAAAATGAAGTGCCCTGCCCCTCTCAGGCAGCCTGTGAGAATAAGAGCCACC 

20 • TGGTAGCAGTGAGGGCCAGGTGCAGCTCAGCTCCCTCTCCTGTTCCCTAAAGC 
AGGTGCTGTTCTCGGACAATGTCTACTGGGCTCGCTCAGGGCATCAAGCCTGC 
TTTCATCAGCCTGGGTTATTGAAGCTGTGGGCACTCCCTGTGGACATCTTGGG 
CAACACCAGTGCACCTGCTAATCTATCCTGTTGCTCTGAGTGCCTAAGACAC A 
GACACCAGCCAAGGCCTGGATGGGCTTGCACACAGACACCAGTCACTCCTGG 

25- AGGCTCCTGAGTCTCTCTTGTCACAACTGACCCACACCAAAAGGTCGGCTTCC 
AGCTTGCTTACTCCTCCTGTAGAGACTAACATCTAAGCGCTGTCCCTCTATCC 
CAAGTGAATGCCCTGGGGGATCTCCTAGGTTGGTGTACTGCAAGTGTCCCCCT 
CCACAAGCCACAGCAGAGTCCCTGAGTTTAAGGGACCGACTCTCAGTCTGGT 
TTCTGCTTTyTGTGATTAGCCCTTCCTTGCCTCCCCTAGGGTCATAGTGGCA.ee 

30 AGCCCTAAGAGTCGTGCGTGCAGGGAGGTTACTCACACGGTGAAGTGGTACA 
CGAAACACTTCCAACCCGTGGGGCGCTCGAGGAAGTTGTAGACTCGGCCCTG 
GATGTGGGTGCGCGCCAGCAGCGGGCGGCGCGCACTGTAGATGGAGACCCGC 
GGGTCAAGGCTCACCCGCGGACGTGGGCCGAGGTCTGCAGGGGCCGGGGCG 
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GGCGACACTGGGGTCGACATGGGGGGCGCGAGCCCGGGGGCTCCGGTTGGCG 

CGATGGGCGCATAGACCGTGCTGCCCTCAGGGCCACCTTCGGCCAGCTCCAG 

TGAGAAGGGACACTTCTTGACCACCGCGCTGCCCCGCCGGGCGCCTAGCAGG 

CGGCTCCAACCCGCGCGCTTCCTTTCAGCACTGGGCGGGGACGAGGCCGTGT 

CCATGGCCAGACAGGACCCCTGACCCCGCTGCTCCTCGGAGCTGAGGTGAAG 

GCAGCGCAGAGCGCAGGGCGCAGCGCGGGTAGCCACTGCTGCCAGCCCAGCC 

ACGGGGGGCCGCTGTCTGCTCCGCCTGGGGGCGGGGGCGGGGTTGGTCCCCG 

GGGATGGCATGTCCCGCCCCCAGGACGCCCCGCCCCTGGGCGCCACCGCGCG 

CCTACCTGCGGCCTTGCTCAGCCAGCGGGTTACCTGTCCGCCAAGCCAAGTCA 

GGCTATCTCCGCGTTCCGGGCTGGGCGCACCTGCTGGGCGCACGTGCTTCCCC 

GCACCCCGAGGACCGCCCGCCTGCTTGGCTCTAGGGTCCTCCAACCTGGGACC 

AGCAGATACCACCACCACGAATCACAAGCACCCTTTTTCTCGCTATCTGAACC 

TACCCTAGAGAGGGAGCGGGAGGCAGAAAAGGGCATCCCAGGCCACGCATG 

GACACTTTCCGGCACAGAATGAGGAAGTGTTAGCCCAGCGGACACTGAGCAC 

CCTTGGGTGTCCAGAAGCCTCTGTTGGCGGCGGAGAATGCGAAAATCTCTTCC 

CAGTAGTTAATCGCAGCCCATAAGACAGCGGGATGGAGGAGATGGGAACAG 

ACCAAGAGTGGTGCTCACTAAGAGTGGGGTCAAAGGCAAGTGCTAGATTGCG 

GGTTTTACAGGTAGACACCGCCCGCGGGGCTGCCTGAGCGTTTACATGTTTAC 

ACTCCCTTTAAGTTTGTACCTCTCCAGAGTGACACAGAAGGAGCTTTATGTCC 

CGTGGTGGTGTGAGGATAGTCAGAAGGTTTGTGCAGCTGGCTCTTGCTACACT 

GGCCTTTGTGTGGGATGGAAGGGCACCTGCATCCCAGGACAATTCCAAAAtC 

CCATTCTCCTGTGATCCGCCACACCCACCTCTCACACCACTCACAAGAATAAC 

TTTCAGATACCGTCCTGTTAGCTTCAGTTGTCAACTTGACACAGCCTAGGGTC 

ATCTGAGAAGGGAATCAGTAGGAGGACTGCCTAAATTGGATTAGACTGTGGG 

TGTGTCTGAGGAAGATTGTCTTGATTGTCTTAGTTGATGTGGGAAGACACAGC 

CCACTGTGGGCCGTGTCTGTTTCCTGGGCAGGTGGGCCTAAGCTATATAAGAA 

AGGTAGCTGGGCATGAGCCTGAGAACAAACAAACAAGCAAGCAGCGCTCCTC 

TGGGTTTTCTGCTCAACTTCCTTGGTTGTGAATGAGTTCTTTGATGGGAGTTGG 

TGCTATGTAGAATATTGAGCAGG<^CTGCCTCCAGGTTCCTGGTTGAGGTCCTT 

CTCTGGCCCCCCTTTGAAGATGGGCTGTAATTGGTAAGCTGAAATAAACCCTT 

TCCTCACCCAAGCTGCTCTTGGTCCTGGTGTTTATCGCAGTGATGGAAGGATA 

CGAGAACAGGTGGATCAAAGGCTTTGGTGAAAGCTGTGCAGAAAGAACAAA 

GGTGAGGGGCCATTGCTATAACCAGAGGTAGAGTGTGTGCCTGGTGTATGCA 
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AGGACCTAGGTCTGATGAGGATGTGGGGGAAGAGCACCTGCCTCAAGAAACT 
GTCACTAAGGAAAACATTTCCAACTGTGCTGTGAAATCCGGAAGCGGCAGAT 
AAAGCAGAAAGTTGCCGCAGGGGAAGTAGGAGATAGGAGACAAGGCCGTTT 
TCACCACCACAGGCAAAGGATTAATAATAATCATTCATACATAAGGACCATC 

5 TAAAGCCTGGGCAGACTGCCGCAGCCAGTGCTGTGGGAGAGTGGGCTGAAAA 
CAGGATCCGAACACAGGGAAAGAGGCATGCAGCCTTGGTGCCTCAGGGGGA 
AAACATTCTCAGCCTCACTCATAATAAGAAAATGCAAATTAAAAGTACACTC 
TGATACCAGCGGCCCACTATCTGATTGGCAAAACTCCAGATGTTTGACGGCAC 
AATCTTTGGGAGAGGCCAGGGTGGCAGGTGCTTTCCTAATCACCCGTGGGAA 

1 0 GGGCATCTTGGCAGCCTTAGCAAGAGCGTATGTGCCTTTTGACCTGGAAATCC 
CACCACTAATAGCCAACCCAAAGACAAACGAGAAAAACAGAAGATGAAGGT 
GGGCATGGTGGAGCACACCTGTAATCTGAGTACTTGAGAGGTAGGAGTTTAC 
AAGAGTATTAGGAATTCAAGGCCATCCTCACCGACATAGTAAGTTCAAGGCC 
AACCTGGGCTATGTATGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGA 

15 GAGAGAATACATAAGCATATAAACCTGTGTGCTTACTGCAACGTGGCTTATAT 
TAGCAAAAGATAAGCAGCTGAAATGATGGCTCAGTGGTTAAGAGAACTTGCC 
TCTCAAGCATGAAGACTAGTGTTAAGAATCCATCACTCATATAAATGCCAAGT 
GTGACCCTCTGCATGCTTGTGTGTGGGGCGGGGGGCAAAGACTCTGGGACTT 
CTGGCTCCAGCCTAACTGAAAAACTCAAACTCTAAGGTTGAACAAGGGACCC 

20 TGCCAAGGCAAAGAGTGATATAATAGGACACCAGAGACCCTCTGCTGCCTTC 
TGCAAATGCTCACACGCGCATGTTCACACACCTGTGAGTCCATCACACACACA 
CACACACACACACACACACACAAAGACAAACACATTGCGGTTCTGTTGGTTG 
ACTTATACTCTAACAGAGCCACACAATAAAGGTTCATGACAGTCATTATCACT 
AATATCCAAAGGAAAAAAATAATTGAACCATAGGTAATAATATTGGTAGGTT 

25 CCTACTCCAGATGACCCCCAGGCTAAAATTCTCAGTGCTCAAGTCTTTCACGT 
AAAGTAGGGGAGTCTGTGTTTAGAAACTATGTGCATCCTCCAGTGTAGCTCAA 
CCCATCTCTAGATTCCTCAAAATACCGAGCACAATGGGAATGTTAGGTGGAG 
TAGTCTACATTATATTGTTGGGGAAAGAATAAGAGCTGTGCTAGTCCAATCCA 
GAAGTAATTTGTTITTCAAATATCTTCCATCTGGTGTGGGTTGCAGAACCTGG 

30 AGATTTGCAGCACCAGCTGTCTGAACAAATCAAACATATATGTTTACATATCA 
AATAAGCAATCCTTTTGAGTCTCTTAGGAACCAAGATATAAAAGATACAGAT 
GTAAAATCAAAGAAGTAAAAACAAATCITTAATGTGAGTTTAAAATATTACC 
ATAATCTCCTCTGTTGTTTCCnTrCAAGATTACTTTACAGCTATATCCGACTAA 
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AAACTTTGGAAAGATCAGGGTGAGGATGGCCTGGTTGGTGAATTGCTTGCTT 

GGACAAGCATGAAGACTGAGTTCAGTCCCAGAACCCACATAAAAATGTGGGA 

CATATACCAGGCATAGTAGCCCGTGCCTTTAATCCCAGCACTTGGGAGACAG 

AGGCAGGTGGATCTCTGTGAGTGCAAGGTCAGC.CTCTTCTACACAGTGAGTTC 

CAGGACAGTCAGAGATGTAGAGAGACCTTGTCTCAAAAATCAAGATGGACTG 

CCACCGAGGAACAACACCAAAAATTTGTCTTCTGGTTTTCACACACATGTACA 

CACAGACACACACCCTTGTGCACCACACATACTCCACCACATACAAATATAT 

ACCCATGCTTGTGTAAACTCCACCCCCCAAACACACACACTTGGAAAAGTCTG 

AGCAAGAGCAGAATTCAAGCAGCACAGCTGCACAGAGAGACTCGAATCTGTA 

ATCCCTGAATCCAAGACGCTGAGGCAGGAGGATTGCCATGAGTTCAAGGTTT 

ACAGAGTGAGCTCCTATCTCCAAAAACAAACAAATATATCAATTGAAGCATG 

CCTAATATCCTAAAACCCAGTTTGTTCATAATAAGACTTGGAAAGTCCTCGTT 

GCCCTTGGACTGTTGCTTGGCTCTTTGTTCTGAAAATGAGAGGGGGTAGGGTA 

AGATGTGGCCTGCCTTGCCTTCTCTGCACCAGCTTCAGGGTGTGAGGAGGCCA 

TGCCTGCCAAAGGTTACACACAGAAGAACGTGTGGATAGAAGTCTGATGTTT 

TGAAACCATCAACAGGAAGAGGATGGGAGTCAGGTGCAGATTAAACCATACT 

GCCTCTGAATTAGGAATTGTTGGAACTATAGTGAATGCACGAATTATACTGAG 

CTATTTACTTTCATGTGTGTTTCCATTTTTTTTAAATATACTCATAACTGTCCA 

GGTCTGTCCTGGGGGAAGATAAGAAGAAGAATGCTCTCCTAGGACCTTTGAG 

ATGTGGCTCAATGGGTGAAGTTGCTTGCCACCAAGTCTGACAGTCCAGGTTGT 

ATTCCCTGGGATTCGTATGGAAGGAGAGAACAGACAACCCTAACTTGTCCTCT 

GACCTCCACACACATGCCATGGTACAAGCATTTACATACAACAAATAAAGGG 

GTTTTTGTTGTTGTGTGGTTTGGTTTGGTTTGGTTTGGtTGGTTTAGGGCTCTT 

GGGGATGCCTGCCCTACCCTGGACAAGGAATCTGTTCTGAAGGACACTGAGG 

CAGAGAGGAGGCTGGGCAAGGGGCTGCTATGAGGTCCTTGCCCTCTCCAAAA 

AGCTGGACAGAGAATGTCTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT 

GTGTGTGTGTGTGTGTGTGTTGGGGGGAACAGAACAGGGTGGGAACCAAAGA 

GAGGGACTCTAGGTATTCTAGGTATACTGTTGAGCAGGGAACTAAGAGGGGC 

''GGGCTCAGGGTGGGACTGAAGTTGAAGCTGAGGAAATAAAATCTAGTGGCT 

CCGAAGCAGGAAACACAAAAAGTCCTGCATATGGGTGGGTATCATGAAGAG 

ACCAAACCCATGACGACTGTGGAAGTGGGTAAGGATCCTTGAGGGGCACATC 

AGCCAAGAAGAGAAAACTCTCTTGTGGTAGAGTCTGAGTAGATTGCTTAGGC 

CATGGAGGTCTGTTACTCTGGGTATGGGACTCTGGGGTGCCACCTGCTCTCTG 
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CCGGACTCCTGCTTTATCAAGGAAGCACCTCCCACTTCCACAGTCATTGAGTG 
GGTGCACACATCCCATCAGGCTGCCTCCTGGAAGATCACCCTCTCACTACCTG 
TCTCCAAGCCTTGGAATGTTCTAGAAATCTGGTGCCTGCGGTTTTACATGCTG 
ACTCTCTAAGGACTCTTTGCCAATTCATCTGCACCCCCTTGCCCACTTACTCCT 

5 CGTTCATCTGGTGAAGTCAGAGGGTAACAATGGGGGGGGTGCGTTTGAGACT 
ATCATGACGGTAAAACTAGGGCTGATGTCTTCTATAAGGACCAGAGTCTACC 
AGCCAAGAGGCCTGTTGGAGCAGTCTGACTCCCAAAGGTTTCCCAGATGTCA 
TCTTTTTTTCTGATTAATTTCCCATAATTAATTAGTTCATTCAATCCTAAAATC 
TTGTATGCACTGCCATGGAGCCATTAGGTAGAGTTGAGCAAAAATAGTGAGC 

10 AGACTGGAGGTGCCCACTATCCTAGACAATGTTAACCATCAACATGGCACAA 
CCTAGAATTACCTGAGATACAAAACACCAATTGATGAATTGCCTAGACCAGA 
CTGGCCTTTTAGCATGTCrATTAGGGATTGTCTTGATTGTTAATAGATTTGGAA 
AGAACTAGCTCAGGGTAGTGGCACCATCCCCTGGGCAGATGGTCCTGGACTG 
TGTAAGAAAGCCTGTTAATCATGAGTCTGAGTGAACCAGCAAGCTCTGTTCCT 

15 CTGTGATTTCTGCTTCAAATGCCCTGAGTTACCTCAATGATGGACCTGGAAGG 
TGAAATAAACCTTTTTCCrCCCTTAAGTTGCTTCTGGTCAGTGTTAAATCACAG 
AAACATAAATCAAACTAGAACACTCAATTAAGGGCTCAAAGTGTCTCCACAG 
AGTCGCTTCAGTTAGCCCTGATGTGGAAACTTCTAGTCTTTGGAAAGTTAGTT 
ATGTCAAATGATGTTTTGCTGGGGCACACAGGTGAAAGGATGTTTTTGATATA 

20 GCAAACACATGAAAGGACCTTTGGTGAAGGAGTATGTATTTGGCCCCACAGA 
CAGTGGAGGAGAAGCTCTGCATCACTCTTCTTCTCTAATGACACACATGCATT 
GGTTTGCCTTATATAGCATTATTGAACTCAACTTGTGATAACGCTGCCATTGA 
GAGAAACTCGCCCAAGAGCTGCTCATGGGGTTCTTGCCACAGCTTGCTGGCTT 
CATGGCCTCTCCTTGGGCTGGTTGGTGAGCCTTGCAGTTCCTTCAGGATTGCA 

25 CTATAGCTGCTAGTTCGTGCCTGATGTCTGCCTGCCGAAAGGACTGGACTGAA 
GCTAATGGGTTTGTGTGTGGTATCTTCCTGCCTAGAGGACTTGTCTGCAGCTG 
CTGAGTCATATTTGGTGTTGGCTACAGGACAGAACGGCTGCCAAAGAAGATC 
AGGCTCACCCCCAGAGAACTGTTGCTGAGCAGCTCCACTTCCCCCACATCCTA 
ATAACTTTTCTCTTCCACTACCTCTTCTGGGTGGTAGACTAGAGGAGACGTTG 

30 AACTCTTATTAAAAGTAGGTTGCAAAGAATCTATGCCTACATAGCCTGGGGG 
AGAATTTGCTGGCAAAGCTCTCTTCATGCAGTAGTCACTCAATCTTAAACAGC 
TGGGTGGTGGAGGAGGCAGGAGGGAGGGGGTATGAGGAATGGAGCTTCCCA 
CTGTGAACCACTGATATCTCTCTGTAGCCCCTCCCCGCCACTGGTGACTATTGT 
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CTATCCITGTGGATGGAACAGGCAGCAGTAAGGCTGTACTTCCATCTCTCCTG 

CATCTTACAGGCAGATGCTGGGGTGTCCTCTCCTCATCTGGGCACATAAAGAT 

CTGTCCTCTCAATCAGCTTGCTCTCCAGCCTGGGAAGGGGCATCTAGTGCCTT 

CTCTATGCAAACACTGGGGTGGTGTTGCTATTCCCTCCCATGAAGGGACCCCA 

GAGTCGGTAGGACAAAAACAATTGGGCTTGTCCAGCATCCTTTGGCCCTGAT 

ATCTTATCCTCCCTCCCACGCACACTGTATCTGAAGTCATCTCACTAGGTGGA 

CCAAGAACCCACTGTGCTCTGCTCTGTGGGAGAAGGCTGGACAGTCATTTGTG 

AAGAAAGTTTCCTTCTGAGCCAGTCCATATGCAGAATAGAAGGAGCTAGGCA 

TGCTGTCATTCCGCCACTCCTTACCCATAGGCCACACCCACCCACACTCTCCT 

CTTCCAAGATGCCAAGCTGGTAACCGGCTGAGAATACTGAGGGTTGTGTGCA 

AGTGTGTGCAGTGTGCATGTATGTCAGACACATTCAAACAGCCTTTTGTGAGC 

TCCCTTTTGTGACTGCTCCAAGGACAGTCTCTTAAGAACACAGTGGGAAGGAT 

GTGTTCAGAGGGGAGACCAGCAGCCCAATAAGGCACCAGGAACCACCCCATG 

GGGCCACAGTTATACACAATTTAACAAAGTTGTGTGAGGGTTAACTGGTGAC 

CAGGCTGTCTCTGCAAATGATACCCTGTCATTCCCTCACCAGCACCAAGCCAG 

TGCTAGTGAGGTATCAGAACTTACTTTAGTCTTACATAGTGCTTACTTTAGTCT 

GAAGTTGAGACCTGTGGTAAAGCTGCCTAAGATCTTGCTCAGGTCTCTAGAA 

AGCCACACATACCCCTGAGGCCCCATCTAAGCATGAAAAGCACTTCTAGGCA 

GGAGGGCTGTTGGCACTTGGCTTATGCTGATAGATTCAGTTCTAGCAAATACT 

GAAGGGCCCCCTTCCAGGGAGCTACCACCATCTGCCACAGAATTCACTCCTCT 

CCTACCAAGTGGTATCCTAGGAGCAGCCCCGGGACCTCAAAGGTCTCAGACC 

TCCACCAGAGCATGAGGTAATAGCCTGCTCAAAAGCACAGCCCACTACACTC 

AGGATCCCTTAGCTGGTAGCATCCCTGACACCTCTAGCCCAACAGTACTCCAT 

CTTAGACATTTGCTAAGCCACGAAGCCTCCACTTGGTGGATAGAAATGTATTG 

GGTATTTCTAGCCAAATCCAATGCCATGGTGAGCAGTCCCAGTCAGAATTTTT 

GTCCGTATCCATGATATCAAATGGGCAACACAGCTGGAGGGCTCACCATGCC 

AAACCACCATGACCCCACACAAGGCTGAAGCCCAGAACAGGGGCAGGGCAA 

CTGAGGAACAGAAGAGAAGGCAGGAACGACTCAGTTGCTGGCAGAATTTTCT 

AGGCACCCACTGCTTCTAGTTACACTGTGG:iGAAGGCTGGAACAATAGACAG 

AAAGGCCCCAATTCAGGAGATACAGATCACATCTGTGAGATGTACAGATGGA 

GGGAGGAAGCACATGAAAACCTCACATGCCACTCATGTGCTCTGACCAGCCC 

CAGCCCTTATGTGACCCAGAGCCTGAGAAGCAGCTGCCGACATTCCCATGTGT 

GGGCATCGTGTAACAGGGAGTTGATTCACCTCTATGAGTCTAGGCACAGAGG 
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CTCCTGACTATGAGTGAATAACTGGAGAGAAATGAGTGGGTGGCTTTATAGG 
TGGGTGGGTTGCTTGGATGGTGATGGGAAGTGGGTAGTAGGTGGGAACTTGT 
ATAGATGAATGAGTGGATGAATGGATGGGTAGTCAGCTGATTGGATGGATGG 
AGGGATGGATGAATGGATGGTGTTATCTGGGTAGATAGATGTATAAATGGGT 

5 AAATGGATGAATAGCTAGTGGACAGGAGGATGAATGGATAGATGGATGGTA 
ATAATGAGATAGAGGCATGAGTTGGTAGACATAAGGATAGATGGATAATTAG 
ATGGATGATGGATGGTGACAAGTAGTAGAGTTGATACGTAGATAGAATGAAA 
ATAGCCCCTATAGGCTCATAGAAATTAGCACTATTAGGAGGTATGGTCTTGTT 
GGAGGAAGTATGTCACTGAGGGTGGGTTTTGATGTTTCAGAAGTTCAAGCCA 

1 0 AGCCC AGTGTCTCTGTCACTTCCTGATGCTGATGCCTGCTGACCCAAAT ATCC 
TTCTCCAGCATTATGGCTGCCTGCATGCCACCATGCATTCTATCATGACAATA 
ATGAACTAAACCTTTGAATCTGTAAGGAAGCCCCAATTAAACGTTTTCCTTTA 
TAAGAGTTTCCATGGTCATAGTGTCTCTTCACAGCAATAGAAACCCTAACTAA 
GTCAGATGTATAGCAAGATAGATGGATGGTAGATGAATGGGTAAATCGGTGG 

1 5 ATGGATCAGTAGGTGGACGGACGGATGGATGGATCGATTAATTGATCAGTGA 
GTGAATGGCCTGATTAGTCTTGATTGTCAACTTGACAAGATTTAGAATCGCTT 
TAGGAGAACTGTCTCTGGGCATGTCTGAGAGGATTAAGATTGGGGAGGATCC 
ACACCAGAAAGTAATCCTGGTCTAAAGAGGTCCCAAGGAAAGAGCAATGGCC 
ACTTTCCCTTGCTTCTTCCTCCTGAGCAAGTGTGTCCATCACTGCTGTTGCTGT 

20 CCTCTGTTGACTTCAGACTCCAGCTACTTTGGTTTTCCAACATGAACTGAACA 
GCAGTGATTCTCCAGGGATCTTCTAAGACTGATTGGAACCAGACTGGGCTTAC 
TGAGGTGGCTGACTG ACTTCCTGGGCTCTGC AGAC ACTGGGTCTT AGTTCTGC 
AGCATGTAGATGGTCATTGTTAGACTACTTGGTACCTATCATGTAAGCAACCC 
TATAAATCTCATTAATATGTATATATTCTATGAGTTCTGCTCCTCTGGAGAAG 

25 GCTGCTGAACTCTTAGATGGATGGATGATGATACACAGATTAATGTGTCTGGA 
GGCTTGGCTGGGTAGGGACATAGGCAGAAAGAAGGTGGTGGGTGTCAGTTAG 
ATGCCTTCATAGGAACAGGGACAGTGGCTCAGTGATAAAACATTTGCTACAC 
AAGCATACAGACAAGTTCAGAGTCCTGGAACCCAGATAAAACTATACATAGT 
GTGTATCTGTGATCTTAATACTCCTACAATAAGTTGGGAGGTGAA^jGCAAGA 

30 GAATCCCAGGAAGCACAAGTTCCAGAAAGTCTAGTGTATGTGATGGCAAAGA 

agagaccctgtctcaaacagggcaaggcaagggatgacatatggcatgcatg 

CATCTGCACGCATACACACACACACACACACACACTATAAAACAAGATGCCT 
GGGTGGATGGGTCATTGATGAATGGGTGGATGTGGTCAGTTTTCAAGATATCC 
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TGGGTGCTAACCTAAGCATACTCACCTTGGTATGTCCTCATTGAATCCTAGGA 

aatgctcatcatgatcactgtcctcattgcctgacaggacctggaacaggttc 

agggctggtagttgatgccacactgaggctggctaccggtcctgtaacggct 

ctgcataatgtagaagcatacacattctatagacagcatgtacagggggcac 

5 agatgtgtgcatggaagacaagtacaggccatgggtccaggaaaccattagt 
ttccattagaggcctggcaggaactctgagttggtccctcagcccctctggct 
ttggtctacaactgtgggatagcaccaggtccagtgtcagactaatagacag 
agctcaggcgtaaggcctgtgatgctcttgtcctttctcttcccagaccattgt 
ctccaacttagagaccacagacacgagtccaacacgttgtaaaaaatcctgg 

1 0 gccaagagggtactaaggtcatgagagcaacattggtggccttagaattgct 
gtgggaaggaaaggcccttgattccctgagtgtcaatgctggagatgcagac 
tttctgggcctgCagagtgcatctctccacagaaggcagtcagagacagaat 
tgagtctaagggcagaatgggacagggccattgagctagtgtctgatcctag 
gaggtagtttctctcccaaaagcatgctggagtgggggaggctttgagttaa 

1 5 attctgcagggtggggagacttgaaggcctgccagactttggcctcttctgct 
ccagtccaaacctaggtgtcagcctcagagatcgttcaggctacttaagaac 
actggccctcaaagaacactagccctcatggatgctagacgtggcttttgagc 
aggtgggcatggaaggcagctagtgagactgtacagtaggtggtgggaagg 
aaaactgctaccctggggacacaaactggggtgggagcagcccttgctggag 

20 acttgctctctggaacacagccttgcccaagaggaaatatcttcctggcacca 
gtcagagcaaccacaagagagacagacacagctaccagccaagccatggtg 
atcaatgtagggaagcagctgttgagggcaatttaggtatccctccatggca 
ccttgattctctgtcccctcatcacagtgctcgtaatgagagcatacgggcaa 
ttgagccttgcccaatgtctatctgcccagggcctggaatggtttggaagctg 

25 cttcccactaaatggctatctccccagaccccacttatgccttcatatggctc 
atgactatgcacatgctcaggcacacaactgcctttgggcacatgttctgtat 
actttcatattctgcacttggggttggctggggctgcccagggtccagcagaa 
ctatgggggaagtggctgttgatgggcaaatgaggaagcaaaggtcatttca 
gatga* aaagcagcactgagcaacgctgctggggtgcctgggaagtgagcta 

30 taggggatgtgccctgtgtgggtgtgggcctgggtgaaggggtccttgaaag 
acatgggagcctgtggtgacaagcctgcaggagatgctccaaggagtggtgc 
' cccaagggagcaaaagaaaaagtgtcagggcctttctgggtttgattattag 
gatcacaggggtccatgaaaggttgtgggtaaggagaaccataatatggtat 
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TGGCCTAGAACGGTTAGGGAATATTTGGTGCAGTCCTTCGA.GTGTCCAAAAG 
ATGGAGGTAAAGGGGCAGCAGCAGATGAAGGAAGGGTTCTGGAAGCACTCC 
TATTCTCCACCTAGTCCTCAGACATAGGAAGGCTAGGCTTTGGATTGCTTGGT 
GGACATGAACTTAGGCTAGATTTCTAGAATCCAGTGAACTCTGGCCTTGTTCT 

5 ATACTGAGATCCGGCCATTCCACCTAGGCCCCCGCTATGTACCCTCCTCCAGA 
GCTGTACCCCACAGCTCTCAAGCACTCAAGGCTTATAGTCCCTGCTCTTCAGG 
GCTCAAGTGGCTAGAGCTGGTCTGAGCCCCCAGCCCAGCTTGTAATTTTAGAA 
GGGCATCAAATGGGCATGGCTTATGTCAAGGTAACATGACAGGTATGCTAGG 
GGACATGCTAGCTCAGATGGTGATGTAACTGTGGCCCTCTTGCTTAGGACAAA 

10 GCCAGAGGGAGACCCCTCTCCCTGCCCCACCCAGGTCCTGTTTTTGCCTGTGA 
TCTTGGCGCCCTCTGGTGGCTAATGTTGGGTCCTTGGCTGCTGCCCAACCCCT 
ATCCTCCTACCCCTGGGGTTCATGGCAAAGTAATGAAAGCCAGATCCTGTGGT 
CTCTGGGGGAAGGGCTCACAGAAACACTGTCAAGAGCTGTCATGAGGGTCTT 
CCCCAGTTATTCACAACCGCCTGGAAGCTTGGCATGAGTATCTCGGTCTACAG 

15 GAAGAGAAACCGGGACTTCGTGTTGAAGGAGCTGGATAGATCAACAGGAAA 
AAAGCCAGTGTCACAGCCATGAGAAGCATCACGAGCAACAGCCCTGAGCCTC 
AATTTGGAGGCAGTCCTCATGCAGTCCAAGGCACTGACTGCCTGACTGGTAA 
GTAGCTAGTACTCAGCTCACTCACACCATAGGATTCTCCAGGAGTGTGGCACA 
CCACAGATCAGGGCAGACATCTCCAGACCTTCTGGGACCTCAGGAGACAGTT 

20 CATCTGCTTGCCTATCATCATTGGGACCACTTGTCAGACACCCTTTCATGGAT 
CATGTGTCCTTAAGGCAGTTCTTGTTTCCTCAACTAGAGAAACTGAGGCTTGG 
TGAGAAATGAAGGAGAATTCCCATCTGACCTCTGATCTCATGTTAGAGTACAG 
ACACCCAGGCCGTACTCATCGTCTGTGCCAGGCAGAGCCCAGGGAGGAAAGG 
ACCCATGTTTCATGTAACCCCTCTTCATGTAACAGTGGGCTCTTGGCCAGCCT 

25 GGATGTGCTCTCAGACCAGTGGCACCAATCTAGCCCTTGGTGGATGGGTGGT 
ACCCTAGGTTCACTCATGCCCAGCTTTCTAAGACATCATTGGCTGCCAGGTGC 
TGGGTGTGGCATGTGGGTGGATCGAGGGAGGAACACAGCCTGCAGTAGGTAA 
GTGCCTCTAACACTCAGCACCAGCAGCTACATGCCAGGAATCTGGAAGGAAC 
GGGCAGGTAAGCTCACACCA>tGGTCAGCTTGCATAGGAGTGGGGCCAGAAA 

30 GTCGCTCAGTCCTCTGCTGAGAAGGTAGGAAGGGCAAGCAGGAGCTATGTTC 

tagccacatttctacctcatgttctccatcagggatggcatcatgaggacaag 
tctgccatgagcaagggaaagggctacccattgttcgggagggtctgggtgg 
agggcaatccaaactgtgagcctcagacttggagattacatcagccgcagca 
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CCTTGGAAACTCAGCCCAAGCTCAGGGGCTGCAGGAGGGAGGGGTGTAGGG 

GTGGGGACCCAGCTCCTCATAGTTGTATCTACGTCCTGCTCTGTCAGCAATGT 

CAGGTCTCTTCACCAAGGGGATGAGGGGTGCCATGCTAATTCATGCATGCAG 

TGACCTTCTTAGTTTTACCCACATATCACTGTAAGCTGGCACAGAGAGTTTAG 

GGACAAGTACGTTCAGGACCCCAGGATTTGGGTGCCAGATCTGTCGTGCTCTG 

GGCTCTTCCTATCTTAGGGGCGGAGAAGTACATTTATTTCAGGCTCTTTCTGT 

GACCGTCTCAGCTAGTAAGAGCCTCCAAGGTCTCCACTGCCTTGGGATTAAGA 

ATCTGTTTTTTGTCTTTCAGAATTCAGGCATAGGATAATGATCAGGGTCATGT 

AGCTCCCCCCACCCCCGCAAAAAAACCCCACCAACAAAGGGGGGCATTTTAG 

ACCTAGACACCTGCACATGCAAATCCTATTTGCATGCCTTCCCTCTTTGTGGC 

CACATTTGCCTGGTGCTGGCACATGCCCAAATGCCCCTTCACCTCAGCCTGGT 

GGGGAGAGCAAGGC.CTAGTGCTTCCTGGGTGTTTCTTCCACCTTCCATGGCAT 

CCAGGAGTCTGGACCATCCATCCAGTCTATACTGAAGCAAGGCAAGGTGATC 

TGGTCTACAGGGTCTAATCCATTGTCCATGAGTGGGGGTGCGTTGTTGAACTC 

CATCrTGAAAGGCACTCTCTAGGGCCTAGACCTTGGAAAAGAAACCTCCAGA 

GACTCAGGAGCCCCAGAGTACTCCTCCCCCAGCCAAAATGAACCTGTCCATG 

ACTTTCCCACTGGGAACTAAGTGGGAAGAAGAGTAGGAGGTCCCATTCCCAG 

TCCAAAGACTGAGGACAAACAGCATGGCTCCCATACTTCAAGCCAGAGCACA 

CATACAGCTTGAGTTACCTCTCTTATTGCAGAGCACAAGTCCAGATGTGACTG 

GGAACTCTCTTCCCAATAGTGAGATTTACTGCTAGCAGCATGTTTTTCACATT 

GACCCTAAGACAGTAGCTTTGTCTAGAAATGGGGTCTTTGTAGATGAGATGC 

AGGGAAGGTGGGGGTCCTGTCCTAGTTATGAGGTTCTTATAGGAAGAAGAAA 

GTACAGACACACACAGGCATAAGTCCCTGACCACAGAGGTAGGGACTGAGCC 

CACATAGCTCTAAACAAAGGGATGCCTGGAGGCGCTGGGAGTTGAAAGTTGT 

CCTCAGAGCCCTGGGAAAGGCCCCATAGCCTATCCATTCCTGGATTTGGGTTT 

tTGTCCCTAACACTTCAGGAGTGCACCTCAGTTGGGTCAGGCCTCCCAGCTTT 

GGTCCGTGTTCCAGCACACAGGGACATGCCTGTGACTCTAGCCAGATTCCTAC 

GGGAAAACCATCAGCTGTTTATTCCTGTGGCTGGAGGACTTTATTCAAATCTT 

GACCTCCACAGGCCCTGGGTGTGAACTCCTGGTGT^GGTGAGGTGAATTTAC 

ATCTCCCACAACTGGGGCAGGATGGGCTGCAGGGGA'CCCAAGTAATCAAAGA 

GCTGAGCCTGGCTATAAGAACCAGATACAAGGGAGAGGGCTTGCCTGTGACC 

ACACAGTGACCCATGGTGGCTACCTTCATATAGGGCCATACACTGGAAAATC 

AGGGTGACATGACTCTCTTGGTCTGGAGGCCAAGGTGAAGCCCAGGTGTCCT 
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AGGTCTGCTTTCTGCCACTGAGATAAATACCATGGCCAAAAGCAACTTGGAT 
AGGAAAGGGTTAATTTTCACTTGCACCATGCAGTCCATCATGAAGGGAATCC 
AGGGCAGGGGAGAGGCAGGAATGTGGAGGCAGAAACGTGCAGGCAGGAACT 
GAAGCAAAGTCCACAAAGGCATGCTGCGTACTGGCTTGCACTTCATGGTTTCC 

5 TGACTTGATTGTCTTATAGAACTCAGGACAGCCTGTCCAGGGGTGGTACTGTT 
TCCAGTGAGCTGGATCTTCCCATAGCCATCATTAATCAATAAAATGTCCCTCC 
TCCCAAATGACCTTAGCTTATGTCAAGTCGACAGAAAAAACTAAGCAGTTCA 
TCGGGTAAGGGAAGCCACAAAAAACCAAACCAAAACAAAACAAAAACCCCT 
GTCATAGCGTAGACCTAGAGCAGACCCCAAGAAGCCCCACATCTTCCCAAGC 

1 0 C AG AC AAGGCAC AAAGAGGAGCCTTCCTACCTCTTCTGCCTCTGAGGGTCCTA 
CCTGCAATTCTCCCTTTCTGTCCGCACACCTGCCTCGGGTAAAGAGCATGTCC 
TACTCCACCCAATGGCTTAGCATATCCCAGTTACACCTACATCAAATGCAACA 
AGACTTAGTTTATACTGAGCTAAGTATGAGCAACTGTATGTAGCTAATGAGAC 
CCCAAGGGATGTGTTGCACTGTGACCATGGTTACAATAACCTITGCAACTTTT 

15 GTTGTCACAGAAATGATAACAGTAAGTCAAGGCATTTATCAAATACATTAGT 
GGGATAACCAGGTTAGGTAGGTTACAGCTAAGGGAGCAATTTCTGCTATAGG 
CCTCAGATGCTGTCTGATGACATTTTTGGCATTTGGGCTGGTGGAAGGAACCT 
TGAGGTTTGCTAAGAATACAGGCTAAAGGTTTTTGCCTGGAAGATGGATGTTC 
TGACAATAGAAGTTGGTAATAAATGTGTGCTGAAGACAGCCCAAGATAATTT 

20 AGCTAGAGAATCTGTAACATTTCAGCTTCCATAACTAGGACATTCTAATGAGC 
CAGTCATCTGCAGAACCTTCATCGGAGGTAAACATCAGTTCACATCTGCAAA 
GCCTGAACTTCTGTAGAAGGTCACAATGTACAGTACCAGGAGCTAGGGATCA 
GGTGTATCTTTTGAGATACACTGTCTTTGGGAGTCACACTATAAGCATGCtGG 
CTCCAGCGGTGTGTAGGGAGAACCTTGACTCAACTTCTGTCCAGGTTAGCCAC 

25 AAGAGGCTGAGAATTAGTCCCCTCATAATTTATACCATCCTATACCCCTCCTT 
CTCAGCCCCCTATGTGCACAGGTATCCCACTGTAAAAAGTACAGATTCTGAAG 
TGCTATCATGTGGGCACCCCAAAGCTGGGCACATGGGACAGCCATGAAGACC 
TGGGAAGTGGTATATAAAGTGCTCTCCCATGTCATGTCAGGTCTCTACCTACA 
GTGGAGCCAAGGGAGGGCCTTCAGTATGTTCTTCCTCCCTGGCTGGACACCrrA 

30 CTTTAGGATCCAAGGCATGGTGAAGTAGGACCACAATGAATAGGAAGGGGTT 
TTGACATTCATCAAAGAAACACAGGGGTTAGGGAAGCCAGCTGCATACCACA 
CAGGTCCCTGAAGTGAAGGGTAGGGCTGAGGGGCCTAGCCTGCCTGGGTGGG 
TGGGAGAGGACCTGAGCTCAGCCAGGGTGAGGGATGCTGCAGGCAGCCTCTT 
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CATCCTCTATTTACCACGGTGGGAATGTGGAGGGAAGAGCCTGGTGGGGTCT 
TTCTCTCCCTGCTTCCACCTGTTCTTCCCCCTGCCCTCTTCACCTGCACCCCCA 
GTGGACTTCCCTTCCTGGACTTTCTTCTTGCCTTCTGTTCCTGCTTCCCCTTCCT 
AACCTTTCCTACCCTGGACTTCCAGTTCTACCAGGTCCCTCCTCTCCTGCACTC 

5 TGTCCTCTGAACTAGATCAACTGCCTTCTCACCTAACTACCCAGGCAAAAAAC 
CCTGGGACTTTGTCTGGAGCAGTCTAGGGTATAGCTCCCATGGGTTGGGGAA 
ATGAATTCCCTACTGTCTTTGGAGGTTTCCAGGAACCCCCTGCAAAAGCCTCT 
GAGAACGGTGATGAGTGTGTTAGAATCCAGATGCCCGGAGTTTGGTTAAGTG 
TGGGAAACAGAGACAGGCAGGTGGGTTTGCTGTGCCCCCACACCCAGTGGAG 

10 AAGGAGCCTGACTTTAGTTATAGCTACTGAGAGGGAGGTCGAGCTCAGGTGG 
GGCCAGGAGATGGGCAGCTGCCTCACATGCCAGATGGGACAGGTCTTCCTTA 
TGAAGAGGAGAAGAGGGTGGCTAGTCCATCCTGTGGGTTCCATCATATATAT 
GTTTCACAGGAAAGCCACCATGCAAACAACCCAGAGCTCCTGCCCCGGCAGC 
CCCCCAGATACTGAGGATGGCTGGGAGCCCATCCTATGCAGGGGAGAGATCA 

1 5 ACTTCGGAGGGTCTGGGAAGAAGCGAGGCAAGGTAGGGGCAATACATGCCT 
GAGTGAAGTCCACCATGCCTGGAATCCCCAAGTCTTACTTCGTCCTTAAAGCC 
CAGTCCTGGGGCCTGCTCACTCTGTGTCCGTTCCTTATGAGCTGAGTGCAGAG 
ACCCTGACCAGATGGACAAAAGTCGAGAGTTGCTTCCTGCCCTGGTGATAAT 
GAGAACCTTGAGGTTCAGGACTCCTGCCTCACTCTCCACTCACAGGGGGAGG 

20 ATATATTGCAATGACTATACCACACAGAAGCCTTGCCCCAAGGAGGGAATGT 
CTAAAGCCTGCCTCCTGTGGCCTGCCCTGTTCCATAAGCCCAGCCTCCTCCCC 
CATGATCAGAGTCCCACAACCCAGTGCTGTCTACCACCTGGAACCCACCAGC 
CCCTTACCCTGCAGTTTGTGAAGGTGCCAAGCAGTGTGGCCCCCTCCGTGCTT 
TTTGAACTCCTGCTCACCGAGTGGCACCTGCCAGCCCCCAACCTGGTGGTGTC 

25 CCTGGTGGGTGAGGAACGACCTTTGGCTATGAAGTCGTGGCTTCGGGATGTCC 
TGCGCAAGGGGCTGGTGAAAGCAGCTCAGAGCACAGGTAAGGCTGGCTCCCC 
CTGCCCTCAGTGGGTGAGGTCTTCCCACATCACTTTACGACCAAGACAATCCA 
TCACAGACACACCCACTGGCTCATCTGATCGAGGCTAAGATTCCCTTCCCAGG 
TGATTCrrGGGTjGTTTCAAGTTGATGGAACTAAGCATCAGAGAGATGCTCAAA 

30 AGCAAGCATGTGCTGCCCTTGCAGGGATCTGCAGGTCTGTTCTCAGCACCCAC 
GCCAGGCAGCTGGAAGCTGCTTGTAACTCCTGCTCCAGGGAGACTCAACACC 
TCTGATCTCCTCAGGCAGCTTCACCCATGCATATACTGACATACACTTAAATA 
ATTTTACAAAGAAATTAAAACTAACCAACATAGTGAGATAACTCAATGGCTA 
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AAGACAAGAGTCCCAGTTCCCAGCACCCACACAGGAAGTTGGACATGGAGGC 
ATGTGCTTGTAGTCCCAGTGCTAGGGGAAAACAGGCAAGTCCCTGTAGCATA 
TGACAGTGGCCTTCCAGAGTTGCCCAACCAGTAAGeTTCAGGTTCATCAGAA 
ACCATCTTACAGAGACAAGGTGAATGGCACCTCTGGCCTCCACATACACCCG 

5 TCCCCTGACACAGGTGTGCTGCACACATGAACATACACCCACAAGGGGGTCA 
GAAGTTCTGAGTCTCTCTCATTGCAGAAGTGGTTATAATAAAGGAATAGTTTA 
CAGTTGGATGCAGATGAGACCTTTGTCATTTAGATCCTTGCCCAGAACAAGCT 
GCAGGACAGCATCTAGGATAAATCTTGGAAAAGGTCACTGTCTGCCACACAT 
ATATCCCCCCTGAGCCTTACTATTCCCGTCAGTCTGAAAGTGTTCCCTGTAGCT 

1 0 GGGCACGGTGGTACATACTGTATTTCCAGTTCACGGAACACATGGAATGACTT 
AAGTTGACGAATAGCCTGCATTGTAGAAACCTGTCTGGAGGAAAGAAAAAAA 
AAGTATTTTCCACTACTTGATTCCCTGGTCAGCCAGCTGTGAGGACTCCAGGA 
CCCTTCTCTCCCCCTCCCATAGGTGCCTGGATCCTGACCAGTGCCCTCCACGT 
GGGCCTGGCCCGCCATGTTGGACAAGCTGTACGTGATCACTCTCTGGCTAGCA 

1 5 CATCCACC AAGATCCGTGTAGTGGCCATCGGAATGGCCTCTCTGGATCGAATC 
CTTCACCGTCAACTTCTAGATGGTGTCCACCAAAAGGTTAGTGACAGGTTCAC 
AGGACCACAGCTGATAGGGTTTAGTGAGCTTGGGTGGGTAGAGGAAGCCATA 
GGCAGGCCGAAGAATTGTGCCACTCCGAGAGGCCCTGAAACTATTGAGAGGT 
GAGAGGTGGGAGGATGGGCTGCAGTGGGAAGATGTGGGTTCCATGGGCCCA 

20 AAACTTCCCACCAGGTGGCAGCATCTCTCAGGAATAAGAGTGGCAACCTGCT. 
GTACTCCCTATGGAAAGCCAGACCTATAACGCTCTAGGCCCAGCCTGGCTGG 
ACGGCTGTCCTCAAGTCCCAGAAAGCCCTGGCCAGGTGGCTTTGGCTCTCTGG 
TATTCATACACACCCCCCCCACCCTGTGGTCTGTGTAAGGCAGAACCAATGAG 
AAGCCCAGGAGGCCCCACTGAGGTGAAGTCTGGACCCCAGCCCAGAAGAGTC 

25' TGGAAGGTGGAGCTTGCAAAGCCCTGGCCCCAGCTGTTTCCATGGTTCACTCA 
TTCCAGGGAGGCCCCCTGGCCCAGCGGTCCAGCCTCCCTCTGCCTGAGGGGTG 
GGAACTCTGAGGCCTTGCTGAAGCCTCTCCCAGGGCCCCAGCATGGCGTGCA 
CAGCACAGCAAGGCCCAGCCCAAGCTGCTCCTGAGGTCTCTGCCTCCCCCTCT 
CTCCTTGGCACAGGAGGATACTCCCA/CCACTACCCAGCAGATGAGGGCAAC 

30 ATTCAGGGACCCCTCTGCCCCCTGGACAGCAATCTCTeCCACTTCATCCTTGT 
GGAGTCAGGCGCCCTTGGGAGTGGGAACGACGGGCTGACAGAGCTGCAGCTG 
AGCCTGGAGAAGCACATCTCTCAGCAGAGGACAGGTTATGGGGGTAAGGCTT 
GCTCTAGCTGCCCATCTACAGCATCTTGGGGATGTAGACATGGAGATGTGGC 
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ACACACCAGTAAGAACTTTATAGAGAGAGCGGGAAAAGGCCTTCCAGGAAGT 

GGGGGAGGGTCCTGCATGCCTGAGTGTACAAGGCCAGGGAAGAAGAGCTGTT 

GGTGTGGGATGGAAGAGTGACACAGCTGGGCAGTGTGACCTTGGCTAGGCAG 

CAGCTACCTCCTGGGCAGACGACAGGGCAGTTGCCACTGTTTTAAATGGTGCT 

GCTCTGGCTGCAGACTCTGATAGCCAATAGTCTTCCCAGAGGTGCAGTCCCCT 

TTACTGCCACCAGGGAGCAGAATTGCCTGAGGGAATGGGTATATCTGTGGCT 

AATGGTGGGTGTGAAAGAGAGATACTAGGACAATATTAAGGGACAAAAAAA 

TATTTATAAGCTACAATGTCATTGCTTAGAAATTTTTGGTAAAGCCAGTGCAC 

TTTAACAGTGGAAACAGATCTGTGTTTGATACCACAAAGGTCTCAATAGAAC 

CACTGCAGATTGGGTCTGATCCCACCAGCTTJTGACCGTTTGTTGGCCTGGTG 

TGTCTCTTACAATGCAGAGCAAGTCTGTCCTATGCTTATAGAGTGACTACCTA 

GCCACAGTGTATCTGCTTTTGGGCTGTGGCAGGTACAACTAGGCTTCITTTCT 

CTTAAGTTACTTTGTGTTTAATGCCAGCTCTGGACAGTGCAGATGTAAGGTGT 

GGGCCCACAGCTCAGGGATTTGATACACTATTTGGGTTTGGGTTCACCCTTGG 

GGGGGAAAAGCCACTGTGGGCTGGGGATGTAGCTCAGTTAGTAGAGTGCTTG 

CCTTTTGTGCACAATGCTCTGGGATTAATCCCCAGCACAGAATAAACCAGGCT 

CTGTGGCACATGCCTGTAACTCTAGCACTCTAGAGATGCAAGCAGGGGATCA 

CCCTAAACTATATTTCAAAATTAAGACTAGACTTGGGGCAGGGGAGATGGCT 

CAGCGGTTAAGAGCACTGACTGCTCTTCTAGAGGTCCTGAGTTCAATTCCCAG 

CAACCACATGGTGGCTCACAACCATCTGTAACAGGATCTGATGCCCTCTTCTG 

GTGTGTCTGAAGACAGCTACAGTGTACTCATATACATTAAATAAATAAATAA 

ATAAATAAAATTTGTTTTTAAAGACTAGATTAGGGGAAAAAAAAGCTTTTGC 

CAGGCCAGGCCAGGGCGTGCAGAAACATGGCACCCAAGGATGCCTGAAGTTG 

GAGTTGGAGATGGTCCTGAGTTCTCACATTCTGAGCATCAAATGGCCTCATGG 

CTGCTCTGCCCACAGGCACCAGCTGCATCCAGATACCTGTCCTTTGCCTGTTG 

GTCAATGGTGACCCCAACACCCTAGAGGTAAGGGCAGAAAGAAAAGAACTC 

CCCCTCCCCTGGCTGGAGCCCTGAAGGATATGGTAGTTTCTCAGGGCATCTAC 

AGGGTGGTGTGGAGGGCTAAGGCACCTCCTAAGCATTCATCTGGGTGTCACC 

TTCTTTGCCTCCCCCAGAGGAT1TCCAGGGCAGTGGAGCAGGCTGCCCCATGG 

CTGATCCTGGCAGGTTCTGGTGGCATTGCTGATGTACTCGCTGCCCTGGTGAG 

CCAGCCTCATCTCCTGGTGCCCCAGGTGGCTGAGAAGCAGTTCAGAGAGAAA 

TTCCCCAGCGAGTGTTTCTCTTGGGAAGCCATTGTACACTGGACAGAGCTGGT 

ATGTGCTGCTGGGGCTGCCTCTGTCCTCCCAGCCTCCCCTGGGTGGGGATGCT 
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CTTTCTGAAAGTCCTGTGGGCCTAGCCTATGACCTCTCTTCCTATATACCTCAG 
AATTATCCAGGTCCTTGTGTAGCAGAGAGACCAGACAACTAGCCTGGTAGGC 
AGTTCTGAGCTAGGTTTGTGGGTCAGCAGGGGCAAAGTGGTCCATCTGAGGA 
CTCTTCCAGGACATATTCAGATCTCAAGGCTTCTTCCTGGTCTTAGTTACAGA 

5 ACATTGCTGCACACCCCCACCTGCTCACAGTATATGACTTCGAGCAGGAGGGT 
TCGGAGGACCTGGACACTGTCATCCTCAAGGCACTTGTGAAAGGTGAGCCAA 
GTAGACCTGCCCTCAGAGGCTGCAGACTTAGCAGTGTTCACAGGGGATACCT 
GTCAGCCCTTCCCAGGTAGGAGAGATGACTTACAAATTCATGAAGTGTTCAGT 
TCAGTAGCTTATAATCTAGCCTCAGGGCATTGTAACCATCCCCAGTTCTGGAA 

10 ATTTTCTATTAACCCCACCCCCACCCCACCCCACACACATCCAATTGCCTTAG 
CAATAACTTCACCTTGTCCTCAGGTAATAGTCCCCTGTTTTCTGTCTAAGAATG 
TGGGCTTTAGGGCCAGTTTACATAAATGGAATTACAGTCCATGTTATTTGTAC 
CCAGCTTCCTTCTTGAAGCACTCTGCCCACAGACCCATCCCCTTGGAAGTCTG 
TGCCCGGAAGTCCTCCCTTTTTATGGCTGAGTTCTGATGTCTACTGTGTAGATG 

1 5 TTCCATAGGTCTGTACATTC ATCCACTGGTGGATATGTTGGATTGTCACCCAG 
GCTTGTCCATGTGGCAGGGCAGACCTGGAGCACGTGTGCACAGGCTGGAATC 
TGGGAGAGACCATGCTGGGGAAGGGATGGGGACATACCTGAGGAAGGGGCC 
ACTCTAAGAGAGCCTGGGGCATCGGGGGCTGCGGATGTGGCTCATTTGGTAA 
AATGTTTGTGTGCCATATACAAAGCCCTGGGTTCAATGTCAATGCCACATAAA 

20 CCAGATGTCATAGCTTCTACCTGTTAGACAGGAGGATTGGCTGCTCAAGCAA 
ATCGGCTACATAGTAATCTCACAGTCAGCCCAGACTATAAAGGGGCAGCTCT 
GGATTTGGTCATTCCCCTGCCTGAGATAGTGAGGCCCAGGAAGAGATACAGG 
ACCACATGTGGTTTTCAGTGAGTCACACTCCCATGGAGGCAAAAGCCTGCTG 
GTCTTCACTCTGGGGCTTGTTGGTAACTGTGGTCTCCAGTGAAAGTGTCAGGC 

25 ATTCCCATGGAAGAAGTGTGCTTGTATGAGAATGTACAGCCTTGTCCTGCTCT 
TGGGCAAGAAGATGCTGGCCAAATTGCTATATTCCCGAGAGCATGTTTCTGCC 
TCTCCAGACTATGTCCAGGGGATTCTGGGGTGTTGCCCCTGAGCACTCAGTCC 

tatgtcacttgctgctgtttagaggcagctgccaacccccatcaagtcctcgc 
agagttctagtccctcccctccagctctgtggactca7:ccccacccacagcct 
30 gcaagagccacagccaagaagcccaagactacctagatgagctcaagttagc 
agtggcctgggatcgcgtggacattgccaagagtgaaatcttcaatggggac 
gtggaatggaaggtgctctccccgctcccctggcccaacttgggaacaccag 
gcagggaatacactgaagcctgaagtagtaggatttcatgggtagcatctgg 



BMSDOCID: <WO 02054069A1_I_> 



WO 02/054069 



71 



PCT7TJS01/49808 



GTCCATCCTGTCTCAGATATTGCCCTGGAGGGGGTCAAGATGATACTAGGGA 
ACACTCAAAGTCCCAAATGGGGCAGAGGGAATCCTAACACTCCTAATGTTAA 
CTCACCAAACCTTTCTTATCCCTGGCACCTGAGGCCAGCATGCTCTGCTGACC 
CATGCCTGCCCCAGCTTCAGGGAATATGTTGGGGGTGTCCAGGGGAGGATGT 
5 TCACCCTCTCTTCCTCCCTCTTATTGTCCTGATCTTCTCTACTGGGTGAATCCA 
GTCCTGTGACTTGGAAGAGGTGATGACAGATGCCCTCGTGAGCAACAAGCCT 
GACTTTGTCCGCCTCTTTGTGGACAGCGGTGCTGACATGGCCGAGTTCTTGAC 
CTATGGGCGGCTGCAGCAGCTTTACCATTCTGTGTCCCCCAAGAGCCTCCTCT 
TTGAACTGCTGCAGCGTAAGCATGAGGAGGGTAGGCTGACACTGGCCGGCCT 

10 GGGTGCCCAGCAGGCTCGGGAGCTGCCCATTGGTCTGCCTGCCTTCTCACTCC 
ACGAGGTCTCCCGCGTACTCAAAGACTTCCTGCATGACGCCTGCCGTGGCTTC 
TACCAGGACGGGCGGAGGATGGAGGTGAGTAGAGCCAGGTTAGGCCAGGGC 
GTACTGGATAGGCTGAATGTTGATGCCCTGTTCTGCAGGAGAGAGGGCCACC 
TAAGCGGCCCGCAGGCCAGAAGTGGCTGCCAGACCTCAGTAGGAAGAGTGA 

1 5 AGACCCTTGGAGGGACCTGTTCCTCTGGGCTGTGCTGCAGAATCGTTATGAGA 
TGGCCACATACTTCTGGGCCATGGTGAGCAGCCACCTCAGTCCACGGCTTCCT 
TGAGCATCTCAAGAGGCAGACTGGGGAATGTCGGGATGGTCCGGCCACCTCC 
TAAAACCCTGTCTCTCCATCTTCCAGGGCCGGGAGGGTGTGGCTGCTGCTCTG 
GCTGCCTGCAAGATCATAAAGGAAATGTCCCACCTGGAGAAAGAGGCAGAG 

20 GTGGCCCGCACCATGCGTGAGGCCAAGTATGAGCAGCTGGCCCTGGGTGAGT 
CACAGGCAGAGGGCGTCTGCCTCTGGTCCCCTGCAGATATCTCTGATAACCAC 
GTGGCCCAGGGTAACCCCCATGGCCTCCTGTGCAGAACAGAACACAGAGCAC 
TTCCTCCTGGGATGGAAATTGTTCAGGAAGTCCTGACCGGGAGGTAGAGTCC 
CTCACAATCCCAAGGGAGAAGGGGAAGGGCCAGAGGTCAAAGGTTGGGAAG 

25 GGCTCTAAGGTCAGTGGGAGTCTCAGTGGCCTTGTAGCATGGGAGGCCTGGC 
TAAACCCTAGGAAGTTGATTCAGGGCATACACTGCAGATGGCACGGGGGGAG 
GGGGGCAGTAGGGGGAGAGGGGTTCCAGGGACTGGACATCAGCACTCCATCC 
CAGCAGCTGATTTTGTTTACAGATCTTTTCTCAGAGTGCTACGGCAACAGTGA 
GGACCGTGCCTTTGCCCTGCTGGTGCGAAGGAACCACAGCTGGAGCAGGACC 

30 ACGTGCCTGCACCTGGCCACTGAAGCTGATGCCAAGGCCTTCTTTGCCCATGA 
CGGTGTGCAAGTAAGTAGTTGACCAGCCTAAGAATGGCCAGGGGCAAAGAG 
GGTTGGCAGGGCCCAGGCTACCCACATAGGATACTGAGGTCACTCCATGAAT 
GTCCCAGCCTGAAGCTCTTAGCCCATGTTGCAGCTACACAGGGGAGGGACCA 
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GAAGAGTGGCTGAATTTCAGCTTCCTCCATCAAGAATACTTTCCTAGCTTGGT 
TCTTGGGCCCAGCAACACCTGGTCTCAGTTTGGTTGGCTGTAACTCCAATCCA 
TAATCCTGTTATTGAAAGGTTACAACAGGGCTGGGGGTGTGGCTCATTTAATG 
GTGTGTCCAGCATGCTTGAAGACCTGGGTTCAATACCCAGCACTCTATAAGGC 
5 AGGTGTAGTGGCACATGCCAGCAATCTTATCACTATGGTGGTAGAAACAGAA 
GGCTCAGGAGTTCCAGGCCATCTTTGGGTATGAAGTTGATAGTGATAGTTTTT 
CTGGGTTACAGGAGACTTCCTCAAAGAAAAAAGAAGAAAAGCAGGCAGGCT 
GCAAGACAATCAACCACATTCTCCTATAAAGAGGATGCTTCTGCCAGAGCGG 
GTGCTCCAGGAGGGGGCTTGAGAAGGGTCTGGGAAGGGTATCTCAGGCACAC 
10 ATCCCTACATGAGATAAAGTGTGGTGACAGGCAGCCACAACATATGACTTTA 
TTCCAGGCATTCCTGACCAAGATCTGGTGGGGAGACATGGCCACAGGCACAC 
CCATCCTACGGCTTCTGGGTGCCTTCACCTGCCCAGCCCTCATCTACACAAAC 
CTCATCTCCTTCAGGTAGGTGGACACAGGCGGCAGCAGGTACCAGTCTCGAT 
GTTCTCATGTCTCACGGCCTTATGTCTGATTGATTGGCTATGTGAGCCCCCTCT 
15 CCCCCATGTTGGACTTTCTTCCGTCTACGTGCTTGGGATATTCTTCTCTGTGTT 
GCAGTGAGGATGCCCCGCAGAGGATGGACCTAGAAGATCTGCAGGAGCCAG 
ACAGCTTGGATATGGAAAAGAGCTTCCTATGCAGCCGGGGTGGCCAGTAAGG 
AGTAGCTGGGGGGGGTGAAGGGGATAAAGGGGGGCAGGAAGGATGCAAGAA 
ATAGGGAAGGAGGGAAGGAACTAGAAAACAGAAAGATAGAGTGGGCCTGGA 
20 AAGATAGAGCACAGACAAAATGGAGGAGACTGAGGTAGAAGGGATAGCGCA 
GGGCTAAGAAGGTAGGGGAGGGTGACGGTTAGGAAGAATGGAGAATGCTGG 
GATGGATGGAGAGAGTCAGGGCTACCACTAGGCAGTGGCCAGCTCCTAGACA 
CACGGTAGACAGAGTCAGGCTCCCATGGAGTACCATGGTTTGGTATCTCCTCA 
CCATGGTGACCTCTGCACTCACACAGATTGGAGAAGCTAACAGAGGCACCAA 
25 GGGCTCCAGGCGATCTAGGCCCACAAGCTGCCTTCCTGCTCACACGGTGGAG 
GAAGTTCTGGGGCGCTCCTGTGACTGTGTTCCTGGGGAATGTGGTCATGTACT 
TCGCATTCCTCTTCCTGTTCACCTATGTCCTGCTGGTGGACTTCAGGCCACCAC 
CCCAGGGGCCGTCTGGATCCGAGGTTACCCTCTATTTCTGGGTGTTCACACTG 
GT^TGGAGGAAATCCGACAGGTCAGTGGCCGGCATGGCCTTCTCCTGGGCT 
30 CTGCAACTCAGGCTGTGTTCTATGTGCCTCAGGTGGGACCTGTTCCGGGGGAG 
GTAGGACTTTTTAATCCATTAGAACAAGACCACTGTGTGGCCTTTCTGTCAAC 
CCACTCTACTGAGTCCTGGTCCTGGTCACAGAGTGAGCCCCAATCCCAGATCC 
AGACCAAGCCCCAATCTTAATCAGACTAAGTCCTGATCCTAGTCACAATCCTA 
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GAGCCAACCTGAGCTGATTTCAGCACCATCTTGACCTCTGATCTCTGCTCCAG 
ATTGATCTTGATTCTGATACCCTAGTCAGAGTCTGAACCCTGACCTGAGTCAC 
TGACTAAGTCCAGACTCTGAGTTTTAGTCACAGACAGTCCTGACCCTAAGTTT 
GGATTGACCTCATTTTTTTTGTCCCCAGACTTATCCCTGAAATATCTTCAGTTG 
5 ATATCTCCATTCCTAGCCATACAGAGAGTGCTAGTCATCAATTGATTCTGATA 
AACTTGGAAACTGAGTTCTGATTCATGCAGATTCCACCCTGGTCTCAGCAAGA 
AAGACACGGTGCTGCAACCAGGCTTTCTCCCTCTGCCAGGGCTTCTTCACAGA 
TGAGGACACGCACCTGGTGAAGAAATTCACTCTGTATGTGGAAGACAACTGG 
AACAAGTGTGACATGGTGGCCATCTTCCTGTTCATTGTGGGAGTCACCTGTAG 
1 0 GTATGTGGGCAGGCTCTAGGGGTCCTACTTATCCCTCTGAAACCGAAGTGTGG 
TCCTGGAACCTGGTTTTTGGCCAGCTGACTAGCATACTTTGTGTCTTGGGGGA 
AACCTCCTGTCCTACATGGGCACCAGCAGCCTTACCCAGCCTCCGTCCCTCTC 
CAGAATGGTGCCCTCGGTGTTTGAGGCTGGCAGGACCGTTCTGGCCATTGACT 
TCATGGTGTTCACACTTCGGCTCATCCACATCTTTGCTATTCACAAGCAGTTG 
1 5 GGTCCTAAGATCATCATTGTAGAGCGAATGGTGGGTCCTTGGGACCCTGACCC 
TTGAGGGGGCTGAGGTACCGGGAACCCCACCCTGGATAGATGGGGAACCTAG 
ATTATTCAGTGGATGGTGACCCTAATCCTGTTGTGGGGGGGGGTGAGCCCTTT 
GGGATCCCAGCTCTGGATAGCTGCAGAGTCCAAAGGAAAACTGACTCATAGA 
TGGGAAGGGCACTGAGCTTTGACTCTAGATGGGGATGAGGAGTATCCAGCTT 
20 TGGATAGGTGGTAGCCCATGGGACCCTAGCCCTGGGTGAGATGAGCTCCCCA 
GAACCCTAACCTGGGTTAGGGGTGAGCCCTATTGGACCTTGACCTTGGGTGGT 
GGTGAGCCCCGTGGTGTCTCAACCCCGGGTAGCAGTGAGAATCCTGGGACCT 
CCGACCCTGGATAGGGGTAAGTACTAAAACCCCAACCAAGGTAATGGTGAAG 
ACCCTAACACCTCAGGCCTGGGAGGGGATGACTCCCCTGGGACCCTAACCCT 
25 AGGTTGAGGTAAAAACCCTAGAACrCCAACCCTAAGTGAGGGTAAACTCTCT 
GAACTCCACCCTGGGGTAGAGTGAGCCCTTCTGTGACCCTTGAGGGCAAGTTC 
CCACCTAGGCCATGCCCTGTGACCCTCCCTCTGCAGATGAAGGATGTCTTCTT 
TTTCCTCTTCTTCCTGAGCGTATGGCTTGTGGCCTATGGTGTGACCACTCAGGC 
CCTGCTGCATCCCCATGA7GGCCGTTTGGAGTGGATTTTCCGCCGTGTGCTAT 
30 ACAGGCCTTACCTGCAGATCTTTGGGCAAATCCCTCTGGATGAAATTGATGGT 
TTGTGCTGAGCCAGGAAGGGTGGGCTGATGGGCGGGTGGGCTCTAGAATATG 
TGCTCCACTGGGACTTCTGGGTCACCAGGTCCCTGGCTGAAAATGGCTGACAC 
AACAGCACCCTAAGTCTAGGCCACAGGCCTTGCATACCTCCTAGGCTGAGGT 



WO 02/054069 



74 



PCT/US01/49808 



CCAGAGTAGGGTAAGACACAGCCAGCCGTGGAGCCCTCAACCATTCTCTGGC 
TGGGAAGTAGGCCATGGAGATCCTGAATATGCAGGCCTGGCTTTCTTCTTAGG 
GAAGTCAAGTGATGCAATGGGCTTCAACTTGTGTGTGTTGCCTGTTGATGTCA 
GTGGTATGAGGCTTGTCTGTAGACATGAGTGTCCCCTCAAGCTAGGCCCTAAA 
5 CCTAGTTAGGGCTGATGGAGATTCTAGGGACTTCATGACAAGTGGACACAGC 
TCTGTACTTACTGCAGGGGTCTTGTGGGTAGTGGAGGAAGAAAAACCCTGGA 
GACAGAGTGGGGACCCAGGCAGACCTAAGGTGGCTAAGATTGGGCAAACAT 
AGAGATTCAGCTCATTAGGCCGGAAAGAGGCCTAGAGAGTGGGGGATCAGA 
ACAGAGGCTCCATATGTCCATATGTGTAGGCTTAGAGATAGGATGCAGATTC 
10 AGGCACTGGGGGAGGTGTCAGTATTCTGGCTAGTGTCAAGGGACTTAGTGTT 
ATACAGCTTGGATGGTCAATAGAGACCTGACCCCAGCCATTCTGGGGGAGGC 
CAGTAATGAGGCCAGGTCAGCAATTACAACACTGGATGAATATTTGTCAACA 
GATGACAGTGAGATTGGAGCAAGCCCAGGCCTCCAGGGACAGACGGGCTGG 
AAATTAACrrCTAATTGGGGTGCTCCCAAGCCATACCCTGGGCTGTTTTCTTCT 
15 CCCCACCACATTGCTTTTGGTGTTTGCTCAACTGAGCCATCACTTTTCTAGGCC 
ATTAGCTTAAACAGACATTGCCCGCTTCTCCCTTCCCCTGAGCCATATGGCCT 
CATGGCCTCAGTCCCTGGGGCCAGGAAGAGGAAGGTGAGTACCCCAGCCAGG. 
GCAGCTCTGGGGTCTGAGTTCCATAGTAACTATAATTCTGGTTTCTTCTCCAC 
AGGAATTGGGGAACAAGCTCAGGCCACCTGGCAGTTTAGAAGGGTAGCAGG. 
20 GTGGTCCTTAGtCTAGGTATGGAGGCCACTGCAGGGTC AGCAGAACCACTAT 
AGATGTGGCAGGCAGAGTGAGCAGTGAAATAGCATTGCCAGTCCCTACACTG 
TGTCTACTATGCTGGGAGTGGGCTGGTATTCTCTAACATGCCTGTGCTCAGAG 
GGAAACTGGGACTGGAGAGATGGCTCAGTGGTTAAGAGCACTAACTGCTCTT 
CCAGAGGTCCTGAGTTCAATTCCCAGCAACTACATAACCATTTGTGGCTCACA 
25 ACCATCTGTAGTGGGATCCGATGCCCTCTTCTGGTGTGTCTTAAGATAGCTAC 
AGTGTACTCAGAGGGAAACTGAGGCCTGGGTAGTTCTGGCCCAGTCTGCAGG 
GTAGTCCTCACCTCTCCTCCACATTTCACGCACTTCCCCCTTTTCGCACCAGGC 
TCTGGGCTTCAGGGCCAAACTTGTGTTCAACCAGGTCAGGCAGGCCTTCTCAG 

gcctcaggcaggatttatgctttttcaaggcctc1:ttgcttctaaggggcttg 
30 gccaccctctggggcacctagagaagggaagagagctggagaaacagaacc 
tggagcagattcatagggtggggaatggagtgactgctaaagcagccttgga 
gggggcaaggcaggatctggcactcccagatgactgattgcatccctccata 
tgatagaacactctacatctcacatgggttgactggtcataataactcactgg 
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GTAAACAAGGCAGGGATTTGAGAAACAAGGCAGGTCAGAAGAATGCCAGTA 
CATATACTAGACTTGAAAGTCAAATATCCCAAAATTGTACCTCAAGGCAGCCT 
CGCCTAGAGCATCACCCATAATTCTGAGAGCCGGAAGAACAGAATCAATAAT 
AACTCACTGAAGTGTACAGGTGGTACAGCTGCATGCATACATGCATATATGC 

5 CCACACCTACTTATACAGACAGTTATGTGCCAGCATACAGCCTGGAGAAGAC 
AGGGGATGGAGCCAGGAGGAGGGATAGGAATCTCAGCCTATGCTTCCCTGCA 
GGCTTGGCAGGGTAAAGCCCTGCAGGCAGGGGAGAAGCAGAAATAAAGGGG 
ATGACATTTATAAGGGGCTAGCATTGTGACTCAGAGTCCTTGGGGGATGCCC 
AGGGATCTGGGTACTCCCATTATGGGCAGAGGTACCCAGAAAGCTGCAGGTT 

1 0 TGC ATCCC ATCTGAGAGCACTAGGTGAGAAAGGCATGGAC AGGAGATCTAAA 
GGGCTCTATGGCTCTTGTTCTCACTTCCTCACATGGACTGGGCTTGCCTCCATG 
GAATCCCAGCAGAGGAGGCAGAAGATGAACTTGTTGGCTTGGTGTAACTAGG 
GCTGGTACTTTTGGTGCTTAAACTAGCTAGCCCCATGTTAACCAGGCCCCTGG 
CCACCTACCTGGCACCCACAGCCTGTCGGCCTCAGCTTGGAACCACACTCCCA 

1 5 GTCTGAAGGTGTCTGTTTGCCTTACAGAGGCTCGTGTGAACTGTTCTCTTCAC 
CCTCTGCTGCTGGAAAGCTCGGCTTCCTGCCCTAATCTCTATGCCAACTGGCT 
GGTCATTCTCCTGCTGGTTACCTTCCTGCTTGTCACTAATGTGCTGCTCATGAA 
CCTTCTGATCGCCATGTTCAGGTACCCTCCTGCTTCCACTTCTGCCCATCTGCC 
CTCGCCAATCCATGCTCCATCTTAGTAGGGCTGGTTTCAGGGTGCTGACAGAG 
. 20 ACAGGACCCATGTCCTGGTATTTTGCTGCTGTGTCCAGTGGGCTGGACCTGCA 
GTAGAGAGCCAACACTACAGGTATGGACACACTGTGAGCACACCCCTGCTGT 
TTATCCTGTTAGGAAGGAAGATGGGTCTACAGGGTTGGGGACTCAGAGCTGT 
GTAAATACCCCACTGAGGAAGCTCCACTGCAGTATACCTGAGAATGTGTTTG 
AGGGGTGCCAGACCCCCATCCAAGCCTCCTTTCTGTGGCTCTGATAAAACTAA 

25 TTGAAAACAATGTAGGACAAGAAAAGGCTTATTTCATCTTATACTCCTGGATC 
ACAGTCTGCCACTGAGGGAAGTCAGGGCAGGAGCTCAAGGCAGGAACCACA 
GAGGAGTGCTGTTTGCTAACTCCTCAGGCTCATGTTTACCTAGACTTCTTACA 
CAACTCAAGACCTCTTGCACAGGGAATGGTGCAGCCCATAGTGGACTAATCC 
CCCAGAGACACGCCCACAGGTCACTATGATCTTGACAGTCCCTCTATT^TGGT 

30 TCCCTTCTCAGGTGACTCTAGGCTGTGTCAAGTTGAGAGTTAAAGCTAACTAT 
TAGCCTCTATCTTTCTGAAAGGGCTCCCAGCACAGCCAAGCTACCAACTGGAT 
GCAATGGGGGATGACATATAAGGAGTGGGAGGTGGAGGGGTGAGATGAAAG 
ATGGGAAATGGGATACAGGGGCGTTGTGTGTGGGGTGGAAGGTAGATGCTGG 
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GTTGAGGTAGGGTGCTTTCATGGCAGGTAGAGCAGCTTATAAGGCAAGGGCT 
TGGCTATGTGTCCTGTAGCTACACATTCCAGGTGGTGCAAGGCAATGCAGAC 
ATGTTCTGGAAGTTTCAACGCTACCACCTCATCGTTGAATACCATGGAAGACC 
AGCTCTGGCCCCGCCCTTCATCCTGCTCAGCCACCTGAGCCTGGTGCTCAAGC 

5 AGGTCTTCAGGAAGGAAGCCCAGCATAAGCGACAACATCTGGGTGAGGCCCC 
AGGCTGGATGCCACTGCTCCCGAGGGGAGGGTCCCAATTGACAGAGGAGACC 
ATAGAAGGGACTGGGGTTCCTTGTCTAGGGCCTGTCCAGGTTAACCAAAGCTC 
CAGGTAGACTTGAGACTAGACAGCCACAAGGCCAGCCTGTCTCCTTGCAGCC 
TGAAATCCCTGCATAGGGCTTATTTCTGTCATCTACCTTGTGCCCTGCCTGGTG 

10 GCATCTGCCCTCAGTCTCCCTTGCATCAGCCTGGGCCTAGACTTTCCCTAGAG 
GTCCACACCACTCTCCAAAGCTTGCTGCTGTCTCCCACATTGCCAGAGAGAGA 
CTTGCCTGACCCCTTGGACCAGAAGATCATTACCTGGGAAACGGTTCAAAAG 
GAGAACTTCCTGAGTACCATGGAGAAACGGAGGAGGGACAGCGAGGGGGAG 
GTGCTGAGGAAAACGGCACACAGGTGGGGCAGCCTGTGGGACAGCAGGCAT 

15 ATGAGGCTCAGGCTGGCAGGTGTCATTCCTGTTCATGAGCTGCCCAGAGATGT 
AGAAAGGGCTCTGACAGGAGGGAGGTAAACTCAGATGCAGCAGGGCTGGGC 
TGACCTTGATCAGCTGGGTGTTATGAGGGGCAGTAAGTCTAGGAGGGCTCCC 
CATTGTGAACATGTGGAAATGGGGGGATATCTAGCCnTTCAGTAATAGGTG 
GGGTTTTCTGAACCACCTTAAGGCTGGGTGAGACCATCTAGTGCCAATAACA 

20 GGACCTCTGAGCTGGGGAAACATGGCTGGGT.CACAAGACCCAGCCCTCTTGC 
■CCTAGGAGTGTGTGTATATGTGTGGGTTGAGAGCATAAGTCCATAATTAGGC 
ATCrCTCTCCAATGGGACTCTTTTCCTCCTCTCTCAGGTCTGAAGATCCACGCC 
CAGTTGGTGCTAGATGGTGGAGGGGCTGTTCTTGAATGCTTATAGGGATGACC 
TAGTGTTCATCCTATTGCTTGGTGGCCTATTCCAAGCAGGGAGGGGATCTCAA 

25 TCTCTTTGAGGGTACAGCAGTGCTCCATATCCATAGACAGGCCCCAGATAAA 
GACTGGAAGGGCAGTAAAGCAAGAGGCCAACAGTTAACTGAGGCCCTGAGA 
AAGCAGTTAGGACAAAAAGCTCCCCCTTGTGGAGTGACATGGAAGTGTCACA 
ACTGAGGCTGACCCTGCTTTCTACAGAGTGGACTTGATTGCCAAATACATCGG 
ggggctga; i AGAGCAAGAAAAGAGG ATCAAGTGTCTGGAATC ACAGGCAAG 

30 CTAGCAACTCACTATCCATCTTTCTTCCTGGGGTGGGGGCATCCTTCTGGTTCC 
ATGGGAACTACCACAGCCCTGTACCAGCATCCTGGTGGGTAGCATTAGGGCC 
TGGATAGAGTGGCCTGCCCCTGCCCCAACTCACTTTCGAGACTCATTGTTGTT 
CTAGAGCTGGTCCAACAAAACCTAGGTTCAGCACCACATCCGGTCACTATCCC 
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TTATCATGCACTATGGGAGGACCTGGTCCAGAGTGTGGAGTCCCAGGGCAGC 

AGTAGCCCTGGCAGGAGGGCAATCACATGGGAGGAAACCAACCCGGTCCCGC 

CAGTCCCCACCGAAGTCTCAGTACCTCCCTTGGCTCTCTCCAGGCCAACTACT 

GTATGCTCCTCTTGTCCTCTATGACGGATACACTGGCTCCAGGAGGCACCTAC 

TCAAGTAAGTATGGGTTCAGGCCATGGGAATGAGATGGGTATGGGCTCCTCC 

CTTCAGCCACTAGATTAGGGCAAGAGCAAAGCTGCGGGCTGTGTATTGTATT 

AGGTCCTGCCTTCTCCACCTGTGCTAGAGGTCGGTTACCCTCGGCAGGCCACT 

GTTCCTTAGTGTCCCAAGGCTCTGCCTAGCACTGGCCTTGCTAGCCTTGCCAC 

TTTCCATGTCCTGGCCCTTGCCTCACTCCTGCCCTGCCCCAGCTGAGGCTCTCC 

AGCTGAGGGTCCTTTGTCCCTTTCTTGCTCCCAGGCTCTCAGAACTGTGGTTGC 

AGGAGTCAGCCAGCCTCTGCTAGAGACAGGGAGTACCTAGAGTCTGGCTTGC 

CACCCTCTGACACCTGAAATGGAGAAACCACTTGCTCTAGAGCCCCAGACCT 

GGCCACATCGAGTTTTTGGGGCACATCAACCTTCCCCCACTCCCAGCAGCCCC 

AAGAAATGGTCTTCAAGGCCTTGCTACAGATCACTTCTTGGACATCCCTTCCT 

AAGAGAATGAAACTCATGTCTTTGGCATCTATTCGGGAGCCTCAGAAGTATCC 

TCTCCAGCAGGGCAAGATTTTTCATGTCCCACTAAAGCTTTCACTGGCTTGGA 

CTGGACAGCTGGATCTGGCCAAGTCCTACATAGGACACCATCTGCCTGGATG 

GGGCTATTTAGGTCTAACCCCTGTCTTACCCTGAGTTCCTAAGAAGCCAACCT 

CTTAAACACTAGGTTTCTTTCTGACCCCTGACCCACTCATTAGCTGACCAGCT 

CCTAGAGGGCAGGACTCAGATCTATTGTAATTACCTCCCATCTTTCACCCCCC 

ACAGCATTATCTGTCTGATCATTCTGGCAGAAACCCCAAGATATTGCTCAAGG 

GTACCCAATGGTACTTTACTTTCTATAAAGCCTGTAGACCAGCTCAAATCAGC 

TAAACTGGGCCACAATGGTGGCTAAACGGGACATTTGAAACACCCGGGGAAT 

ATGGAGGATTGTCTGACCTAGTGAAAGGCATCTCCGTTCCTTCCACTGCTCCT 

CAAATTAAATGACCATCCAGGTCCTTTTTAGAGGAACTCAGAGAATGGGGAC 

TACAGAGGCTGGGGCAGACCTGGGTCTTAGCAGGTCTAGCTAACTTGGTCCA 

AGTCCCTGGCCTCCACAGGAAACATTCGCTCATGCATCCTCCCTGCATCCTTC 

TCTTCTTCTCTGGCTCAGCTTCAGTGGAATGACCCAGCATCAGCTGTGTCTTAC 

ACACACAAGTCCTGGAGACACA(^CCAGACACCAACCATATGCTGCTTCACAA 

CTCATCCTCTTCCTATAACCTGTGGTCTGTACAGGCCCAGTGCTGGGGCTGCA 

TTGGTTTCTGGGGGTGGGGGTGGGGGTGGGTTGTGCAGATTATGCTCATTCAC 

TACCATCAGGGGCACAAGGCTGAACACAGCTAAGAGCCCAGTCCCTCTGGGT 

AGTTTCATCAGTGACAAATGTAAATGACCATGGCAAACCTTCACGGCAGTCCT 
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CACTGGAGTGGGGAAGGCAAGGTGAGGCTGCCAGAGATGCAGTCCTGTAGTT 
GGCTGCCCAGGACTGTTTGGACAGAACTCTTTGAGTGGGGCCCGGAAAAAGC 
CACACAATTCAAGAAATATTCCAAACGTTTATTATTCATGGTAAACACTGGGG 
CACAAAGGCCTTCGAGGTCCCCTGACCCACAATTCCCACAGTCTCTGCCAGAG 
5 CGGCTGCACACATCCCCCTACCACAGACAGCCCACACCTTCCCACTTGGACTA 
GCAGGTGAGGGGTAGAGTGATAGCACTGCAGTCAAACTGAGGGTCCTCTCTC 
AGACCCCGGCCCCCCAGGGTTACCGTCCCTGTTCCGGAAGTGGTCTCTGCTCC 
GCTTCTTGCTGCCATGGAAACCAACAGCCTTCACACCTATGGGGCCTGATGAG 
GACTCAGGATGGGCAGCTTCTGAAGGTACCACCACCTCTGGCTGTCCCTGGTG 
1 0 GCCACTCC ATTCCTCAGCCTCAGGTCCAGCGAGATGGGCTAGCTCCACAGCG 
GCCTCACCCCCCGCACCTGACCCAACCCCCTTCTTTAGGACCCTCTTCCTCTCA 
GCCCTAGCCTCACTGTCCCTCACTGGTTTGGTGAAGACCACTCTTCCCTCCCC 
ACAGCTAGAGGGGTCCTGGTTGAAAGAAGGCACATAGTCTGTGTGGGACACC 
TGGCTTCGAGAGCTCAGGAAGGCCAGGGCAGCATCACGATTGCTCTGCTCAC 
15 TGGCTTCAGACACATCCTCCAGACTGTATTTGGTCCAACGCTCAGGGTGTGAC 
ACATAGTCAGGGACTGGGAGTACCCTAGTCGGGCCAGTGTTCCCATGCACTCT 
GCTCAGGCCCCTTGCTGGAGTCTGACTTGGGGGAGTCACGGGCCGCCTGAAG 
CTGCCATTGTCACTCACACTGGTTTGGGGAGCAGAGCACGGTGCCTGCCGGG 
CTGCACTCTCCAGACAATCAAAGATGCTGTGGCTGCGCTGGGAGAAGGTAGA 
20 ACTCATGCCTCGGAGGTGGAATGGCTGCACTGCAGTQGTGAGGGTGCCCGTG 
GGGGGCGATGGAGGGTCCTCAGGGCCGGAGTCCTCCTGGCCCTCCTCCGAAA 
GCCTCTCTGGGGAGAACACTTGGACCTCCACACCACTAGGTAAGCTAAGGTC 
CGAGTCCGAGTCGCTGAGGGAGACGGTGTCAGAAGGCAGGGTGTCATCCTCG 
GTGGGTTCATCAGCCTCCAGGCCAGCCTCTGTCTCCGCCATACGACCTGACAC 
25 CCGTCCCACCTGTAGCGAAGCAGTCCCAGTTCCGCCGTGAGGGGTGTATTTAA 
GGCCTCATCCCTCCCCGTCCTCCCAGGCTCCCGTCTTTCACCTCCACTATGTGT 
CAAGTGCCTCTACTCACAGCCGGACGGGAGGCCTCGCGAAGCTGGGTGGTTA 
AAGGGTATCCTAGCGCGCACAGATACAGGAGCTGCAGGTTAGGTGGGGTTAA 

ggagagagtcaaccatgagatcataacagtgtggaca::acaatagatgagca 
30 ctgggagatgctggtgacaggagatctccatcagctgcttgtcccatcacgg 
gctctcagtaaagcgcagcctctctaggtacaagcgcgacccgggtccggac 
gaagggctggcagaaacaggcacgggaatgccctggccgcggccttaggga 
ggccagtgctcaccgttccagcgtccgtacacgcctcaataccctgccgcctt 
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TCTCGCAGCGTCCCGCCAGTTCCCCGGGATTCGGTCCTGTCCAGTCTCGCTCG 
GGTTCCTGGGAAACCGAGTCTCGGGCGATCAGATTTTCGTGGCTTCAGCGCCC 
GAGAAACTACAACTCCCAGAAGGCAATGGGTTCCCGCCTAATGTCAGCCGGG 
ACTTGTAACCTTAACAAGCGTGCCAACCCGGAGCGCACACGGAAACCCACCC 
5 AGAAATGACGAAAAAGGAAAAGGCGGAGCCACGGTTCCCACAATACTCCAG 
GGCTAACTGCCAATCGCAAAGGGTTGAGGCCTGCAGCTTCGCCTGTCGGAAG 
CCGTCCCGGCCGGGTCCGACACAAGGAGGAAGTGCGTCACCACGCACTGTGA 
GCGCCAACCAGTTTCCTAGACATTTCCGGCCCTGGGCGGAAGTGGTGTGGGTC 
TTGTGCTTTGTTGGCTTTCTGGAGTGAGGCGGGGCCAAGGTCCTGGTTGGCTC 
10 TTGGCATGAAGAGACGTAGTGGTCCCTGGGGTGGGTTGGTTGGGGCTTCAGA 
GATTCGAAGCCGCTATCCTGGGCTCTGCCTGTCTGTGCTTCCTGACGCAGGAA 
CGGACACTTTAGTTGCAGGACGCCGCCCCTACTAGTCAAGGCCACACATAAT 
CCTCCTCCAGGCGTGGGGCTGTTTTCTTGTATCCTTAGGGATCAGAGTCTGAG 
ACACCATGGGGCTGCTGCATGCGCCTCTCCTTTCCTATGTTTCTGAAGCTGTG 
15 CAGCAGGCCTCCAACAGCTTTGGCCTGGTGCCCATGGACCTCATAACATAGG 
ATATAAAGGGCTCCTCAGACCAGAAACCAACCAAGCACGGAACAGGTTGCAT 
AGGCGTGGGAGGTAGAGGTGGCGGAGACCTAGAATGTCCTAGTACAAGCTTG 
GGGTTCACCGACAAATAAGCCTACTGGGAGGCTGCTAGAAGCTGGGCCATGA 
ATCTTGGCTAGGGCGTCTCTCCTCTAGTGGCCAATTAACTATGCCAGCAGCCT 
20 GGGGGAGGGGCGCAGACAGGAGACAAAGGGAAAGAGACAGTGGTGGTTGCC 
CCTGGCGTCCGGGAGGTTGCTTAGCCTGATCCTCAGTTCACAGGGGTAAGTGA 
ACTGCCATCTTGCCTTTTACTTATTGAGACACAAGCCAGAGGCACCCAGCTCC 
GTGGAATCITTTCTTGAGATGGATGGGGTAATGGGCTAACCTTTACTGTCTTG 
TCCCTGAAAAGGAGGGTGTTCTTGCAGCTTATCTCCATCATTCTGTGCTGAAT 
25 TTCCCTGCTTAGGTGGCTCCTCCTACCCGACTCAGCCTGACCTGTCATGTCCA 
ACAGATGAGTGGAAATGAGGTCAAGGCTGGCGTAAAGTCAGCATTGAGCTCC 
CAGTATAAGGAGGTACAAATGAGGGAGGACCCATATGGAGTGCTCCCAGGGT 
CCCAGCCCTCCCCACCCACTGTGTTCCACAGCCTCTATTGGACCTGCTCACCC 
iTGTGCCTGGCAAAGTTCAGAGCTGATGAGCCAGGTCTCTGACACCGCTAGCA ^ 
30 CAGGAAGAGGTCTGGCAGTCCCATTAGCATGCCTGATTTTTTTCCTCCTTTATT 
AAATGACTGTTTCAAGATTTAATTACAGTTGAAGGTGACACGTTAGAAAGAA 
TAAGTGCATTGAAGGCATAAGAGGGCTTACAAACCCAGCTCTTGAAATGGAG 
GCAGGAAAGGAACAAAGATTAAAAAGATGTGAGCAATGTGACCCACAGTGG 
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GTTGAGTGGGTGGGCATGGCCCTCTAGCTAGGCTACAAAGCCTCTGGGCAAG 
TAGCTGTAGAGGAGGTTTGGCTCAGCAGGCAGAACTCTCCAGGGGTCCCAGA 
CGTACAGGGCCCTGCAAGCCAGGGCCAGCTCCACCCCTACATCCCACATATG 
CCACCCATGTGTGATGTCAGCTCCCAAAAGGTAACAGGAAAAGTTCAGGACA 
5 AAAAAAAAAACTAAAACAAAAACAAAACAAAACAAAACAAAAAACTCAAA 
AGTAAAAAGACTAAGTGTAGCAGAGTGATACAGAAATATTATATACACAGGC 
CATGTCCCCCAAGGTGGCCTCTGGTCACTCCAGGGATCCTCTGGTGCAATGCA 
AAGGGCCTCAGTACACGGAGCTGTTCCGGATGCCACAGCACAGCACCATGCT 
CAGAATCATCTCAAAGATCTGCAGGAGGGAGGGGCATAGTCAGCACCTAAGG 
10 CCCTGGGAAGGATCCCACCCACCCACCAGAGACCCTGTGCTTACCATAATGA . 
CAGCTACCACAATGGCTGCAATTCCAATGAGGTACAGCTTCCCAGAGAAGAG 
CTCATCGATTTTCTGATGACAATCTTGCTGCAGGGATGGCAAGAGCGTGGTTA 
ACCATCAGTGGCTAAGTCACAGCGACCACCCTACTCAGTGACAGGACCTATG 
TATCACAGCTTATCTTCCACCCCAAAACAAATTTAAGACACACGGACGTGCTG 
15 CCTGCAGCCTTTTGTCACAGGGGAGTGTTTTTTTACCTCACTAGCACCCTAGG 
GTCACCAGGGTATGAGGGACCACTCAAGAGAACAGTTTTCTGGGATGTTGCA 
GGGGTTAACTCAAGCTGTGAGGACCCGGTACCCCAGTAATAGAGATGGTCTA 
TGGGACTTCTGAGGTCTTGCTCAAAGTGCTGATCTTCCTGATTGGCCACACCA 
CCCTACCGACCACCACCACCACCTCATCCCCACTCCATGCTACCTGCAGTAAG 
20 GGGGTGAGTATGTTGCCGCCTGAGGGACACAGGCTGTTCCTCAGTATGGTGG 
TAGTCAGTGTGGTCAGTGCGTTGGAGCCACAACAGTTGAGCTGCAGGGACCA 
AGAACAACACTGTGAGCTTTGGAGCTTCAGGAGACAAGGGAAAGATGAGGCA 
CACAGCACACCGTCACCCCAAAATACACTTGGGAGTTGTCCTTCTGCCCTGTT 
TGTCACCCTGCACAGGAGCCCTGTTCCTTCAAACGCCTGGGAAAATGAGGCCC 
25 ATTAGATTCCTACCTGAGGACCCACAAATCATAGAGGAGACCCTTTCCTAGGC 
CCTGTTGTGTTGGAAAGAAAGGAGACCCATGCCCTCACACACTAGGAACCCC 
ACAGCCTCCCCTCCTGCTTGAGCCTAGCTATACCGTCTCATGGAAAGTCTTCA 
CCACAGCCTTGGCATTGTTGGCATCATCATCCATCACAGCTTGCTGAAGGGCC 
TGGTCATAGAACTC::.TTCACATCCTTGGCGATCTGGGGGAGAATCATAGTCAC 
30 ATACAGTGTCGGGGCTACGCCCAAGAGCACTCAACTTCCCGATGGGCTCAGC 
CTGATGCCACCTCTTCCACTGAGCTCCCAAGACCCTGCCCCCACCCAAGCAGC 
AGGTCCAAGAAATTCAAAAAGCACCCTATAACTATGGGCTCAGAGCAACCCA 
CATCTGTTGGAGGGCAGAGGGCATATCTCCCACTCGGTGAACCTAGGCTTACC 
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TGGTCTTTGTTTACGAAGCCCCAGATGCCTGCAGCCACCTCACAGGCAAACAG 

GATCACAAGGCAGGTGAAGAACTAGGGGAAAGAAGAGGTCAGAGCCTTGAC 

CAGCAAGCCCAGGCCl'AGCCCACATGACCAACAGCTATGTGCCAAGGGGGTT 

GGCCATATCCACCAACCATGTCCTGATTAAGCATGAGCCACACAGAGCCATT 

GAGAGACAAGAGCCTGATACAGTCCTGCACTGAGATGTTCAAGTTCTCATAG 

CCTCACCTTGCATCCTACTTTGCAGGATGTGGCTGGTGTAGAAATGCAGTTGA 

CAAAGTGTCCACAGCCTTCACTGAGGCACATGCAGCAAGACACTCCCCAACC 

CTAACCTCAGTTTCCTCACCTCCTTTATCCTACATTCTATGACGTGTGCTGCCT 

TTCCCCAGCTTCTGATACTGGCAGCGCCTCCCTCCCAGGCTGGCCAGATTCTG 

GCAAATCTGGGATTCAGCTTCACCAGCAGCTAGTGTGGCCAGGAGCACTAAG 

GTTGGCTTTAGGGGTGAGACACCCTTCTGGAGTTTATGCTTTTTTAAACAGAA 

acccccttttccaacatgtatAttcatactcagattcaatggaaatcaagccc 

CTAGGAAAACCAAAACCTTGGGGAAGACATATAAGTTGCTAGGAACCCTGCT 

ccccagcccttcagccctttctacttaccgtccccagcagacactgggactcc 

TGGATGGCCCCATAGCACCCCAGGAAGCCTACAAACATCATCACAGCTCCCA 

CAGCAATGAGAATGTAGATGCCTGGGGAAGGGAAGACCTGAGTGAGCAGGC 

TGCATGAATCACACAGGAAGGCCTAACTCACAACCACAGCAGGCAGAGTTTA 

GCAATGTGCGCCGTTTATGCCTCCGTGTGCTGCTGGAGGAGCTGCCACTGCCA 

CCAGGGGACAAAGCAGCAATTCAGTGACCAGAGAACATGGGTGGTCTCCATT 

TCACAGGTATGCTCTGCTCTGGAGCCAGTCTCCCACCACCTGCATCTGAGAAC 

AGGAACCCTCAGCACGCACTGTAAAGGGCTGAGACGGACTATCCGTTTCAGT 

TAGGGGAAGTGGCTTGGGTGTCAGGAGCCAATGCCCAAGACTCCCAATACTC 

GGTGAGTGTGAAATCAAGGGTAAGCTATGGCAGTAACCAACTCCAGGCACCT 

CATCTCAGGAACACTGGTGTATAACTGTTTGGCTCACCTAACTCTAAGGACGG 

GATCCTGTGTGTGGGTGGATGGCTAAGAGAATGGTAGGGGAAGCTTAAGCAG 

AGCACCCAATGTCTGTCACAGTTTACAGGGGAGCACTGTAGCCCATGAGGTA. 

CCAACTGGGCCATGTCCCTGCCTCAGTGCCCACAATACCCCCTTCTGCCCCTG 

GGAAAGCCCACTGTGTCTTTGATGAGCGTTCCATCTCTTGGTCCCTCGGCAGA 

AGCCGTCCATCCCCCAGTCTCCACGGCAA'.;ACTGTAGGGCAGTCCAAGCTTCC 

ttgcacaaagtcaggcccattgttccctgcctggccagaccagaggagaagg 
caccatggcctggacctgctgccagacttaaaaataggccaaatcctttggcc 

TCACTAAACACGTCTGGTCCGGGGAAGGCCTCCCACCAGGGCTGCCTAGAAA 
CTGTTTTCCCCCTTGCCTCCCAACTGGCAGAGCACCCCAGTTGTCAGGCCCTA 
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AAACCCCAACCTCACTCTGTCTCAACAGGAGGCTCTGTCTCAACAGGAGGAT 
ACATACAGGGCACCCAACACACAGGCACTGTCCCAGGACTTTCTAGGCCCTA 
GGCACCCTCCTATGTCTAGCACATGTCTCATCTGAAGCCCCTGTGGTGTTGTC 
TGGGATGTTTAGCAAGCTCTGAATGAACTCTCCTAAGCACTAAATAAGACAC 
5 CTTTTTCAGGGCCACATCTGCTCCTAGAGGGACTCAGTACAGGGGCAGCATG 
GGGCTCCAGAAAGCCAATTCAGATTGGCCAGGGACATGGCAGCCAGAGTAGC 
TTCTACCCATAGGTAGAAGAACCTGCTACAGGGTCCAGTCCCTAAGACTATTT 
CCCCAGTACCCAGGAGGCCATAACCAGACAGGCAGGTGGCTGAGCCGCTCTG 
GAACTGGAGCCTGGTATAGACTATTATGAGAACATTAGGATAGAGACTTGCT 
10 TCCTAACCACCCTAT.CTGGGCTCTGCAGGGTAGCCAGCCTCTCCTGCCATACC 
TGTGCAGAGTGGGAGCACAGCCTTCCCCTTGTGGAGGGCCCACCGGCCAGTT 
CTACCCCTGTCTGTCTCTGATTCCCAGATTTCCCTTGTGAAGATGCCTACCAGT 
TGCCACAATCCAGTTTCTATGAGCTGGAGGGCATCCTCCTTCTCTGAAGGTAC 
CAGATAGAAGGAGGGATAGCTAGTGTGGGCCCTTCCCTACTAACAACCAAAC 
15 CTGAAAACTGCTACCCCTCATGGCTTTCCTCAAAGGAGACCAGGTTCTGCCCC 
TGCCCATATTGCCTGGACTCCACTGCTGTCCATTTTACCCATGCTAAGGCTGT 
GGACAGAGTATAGGGAGGAAGGAAACAGAGCGAGCCATGGGGTAACCAAAA 
GCCCTGTCTGCTCTGAGGCAATTGGGCAGAAAGGGAGAATATTTCAGACCTC 
CTGGTCCCCCACTCAGCCTCAAGCATGCCTCCCAAGCCTCTTCTAGGCTCCAC 
20 AGGAGGAGAAGGAAGATCCACATCAGAGCCCTGATGAAGGTAGGAGCTTTG 
GAAGGGTAAAGACATCTCCAAATACTTCTTCAGTTCTCAGGACACCAGGTCCC 
TCTTCCAGGGAGGAAAGAGGAAAGTTTTCTCCTTCAGCACAGTGTAGTGGAC 
CAGAGCATAGGACCACACAGCACAACAGCCATCATCTTACTTAGGAGGGAGG 
GTGTTCAGCCCCTACAGGCAGCCAAATAAGGCAGGTGTAACAGTCTGGGCTG 
25 TGTGCTGGTGGGGGTGGGGTGGGGTGGGGTGTCTGAGTCATTCACTGTCGGC 
CCAGTCTCCAGCCCTTCCCAACTCAGGACTCCTGGAGCCTCAGCTGTTGCTAG 
CCCAGGCATCCTGCCCCACCCTGCCCAGGGAGGTTCACTAGAAATGCCCATGT 
TAAAATAAGCAAGGTCCTACAAGGCCCCTGCCCCACCCCGTTGCTTCCTTTCT 
GCCTAGTCTAGTCTCAGATCTGTGGGGTCACCAGCAGCACCACA^CAACTC 
30 ATTCTCCTCTTATAGGCAGCCTCCCTACCCAGCTCATACTAGGCCTAGGCCAC 
TCCCAGCTCTAACGCCTCACCCACCAGATTTGGACCATGAGCCTCTTCTTGGA 
AGCCCTCCCTGACTGTTCTGCACTGTGACTGGAACCCGTCTCCTTGGTACTCA 
CCCACGTAGAAGGTGTTGGGTGCCGGTTTGTTTCCCAGTTCCAGGTACAGCAG 
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GCTGGTGGTCTGTGGATCATGACGCAACCACAGAGCTACACCTAGGATCACG 

CCTCCAGCCAGCTAGAAAAAAAAAAAAAAAAGAAAGAAAGAAATGAAAACC 

AGGAACAGTGTCATATAGAGGCCAGAGTACTGGGGCCAATGGAAGCCTACAA 

TGCTTATCATGTAGAGCCCTCACTTCCAGTTGTGGAGCCAGATGCCAAGCAGT 

TGTCTGAGCCACTGCCACTGAGCCCTGGGTCCCACCAGCTCCTCTTGGTTGTA 

ACTCCAGAACAGAATTCAAGTGTACCTCCAATAGCGCAAGACACATACAAGA 

AGTTTGGTCTGAAAAATATGGCTCAGTTGATAAGAGTACTTGGGCATGTAGG 

GAGCCCTGGGTTCAATCCCCAGTACCACGTAGGGTTTGGTGGCAATTGTTTGT 

CATCCCAGCACTTGAGAGGTAGAGGCAGGAGGATCAGGAATTCAAGGTCATG 

TGAGTTTGAGACTAGCCTGGGCTATATGAAACCTTGTCTCCAAAATCAGCAGG 

AAACATCCCCCCCCCCCTACTGTTCCAGATTGACCAACTATGCAACATTTTTA 

CCGTAGAAATGGCCACAGTGAGGCCCAGCCACTCAGGATGCTGGTGGGCTCT 

CCTCAGGTACCCTGTCCCAACAGTCCTGGGATTGATTAGAGACTGCTTCAGGC 

AGGATACTGGGAACTCTGTGGTGGAGACACCAAGTCCTCAGGGGTGACCATG 

TGATGCCACCAGGCCCCGCAGAGAGAAGCAGGTACCTGGGAGCAGAAGCTG 

GGCCTCAGAACAGACCAAGGAAGGCAGTCTCCTCCCCAGCGCCAGGTCTCCT 

CTTCTTGAAGGGTCAGTGGCACTCCCCAGCAGAAACAGCAAATAGAAACCAA 

AACCTGGGTCTAAAAAAAAGTATCTGCTCTTCACTCTGGGGATCCAGGCCAG 

AACACAAGCTCAAGAAAGCTCTGGGAGAAATTTGCAAGCAGGCTGGTTCAGG 

AAAGAAGACCTATGGACCCATCTGCCTTCTTCTACCCTAGGATCATGTCCGCT 

GCTTAACCCTCAGAGAGGAAGCTGGGGAGGGGTACTTTCCTGATCACCATCT 

CCCTCTCCCTCCACCCCTTCTTTCTCAGTTTCTCAGACTCCTCTTTAAGTGCTTA 

ATTTCGTTTTTCACACTCCAATCCAGGAGGGATGAGCCCCCAACCTCCACCAG 

GGCCTGAGAGAAGAGGGTGTGTGATACAGCAATCAAAGTCACACCCGGGCAC 

CCGTTGCCCAAACCACAAATGGCATGAAGGCTGAGACTCTGACAGTGCCTTC 

AC ATGTAACGGCCCTAGCGATGCCAGGGATCCCCAGAATTTCC AGC ATGTTA 

CACTGACAGCTCTTTATCCATCTGCCCTTAAGAACCTCGCTCGAATCAGCTTG 

GGCAACAGAAGCAGGAGGGTGGGAGGTGACGGGTCTGAGTTGCCCTGGTAA 

CCAG^CTTCCTTTGCAGCTTTTAAACATCCTATATTTTGTTTAAGATCTGGTCT 

GGAGGCTGCTCTGAGATGTCCCAGGGCAGGAGAACCAGGTGGCACACCTGTT 

CCCCATCCTTCCCTCATTGATAACATGACCCTGGTGACAATCCCTGATAGTCA 

CAGAGGCTAAGGGGCAGCTGGTGGGTCACCTTGCTTTCCTCAGTCTGTGTGCT 

TACCTGTAAAACAGCTGAGCATGCAACCATGCCTGACTCTGGCTTCTCTAATG 
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GTACGGGAATACTGTCTGTGAGCTGAGGGGCAAAGGGGAGTGAGGCATGCTG 
TTCAGCCGAGCCCCAAAGGCTGCTAATGTGGGTGTCTGCCAAGCCACCACTTA 
GAGCCGGGTTTTATGAAAGAGGTGGGATGAGGACGTtAACTCTAGAAAAGTC 
TCTGGGCTTTATTCATTAACTGACAGTCCTGAATGGGGGGTGCACGGGGGGG 
5 GGGGGCTAAGGATCTGTGACTGTCAATCAGAAGAGAGGCCAAACTGGGGTGG 
GGGTAGGCAAGGCATCTGGAAATGGGGACTTGGCCACTCACCCCATTGTCCC 
AGGGGCAATCAGAGCTGAGTGGGAAGGAATGCCAGGTACTGAGATGGCTTCC 
TCCAGCAAAGTGACCGTCTACCCTGCTACTCTAGGGAGAGGTAGAGGGAGTT 
CTGGAGTTCTAGTTCCAGTCAGTGAATGCACAGGCCCTGGGTCCATCGAATGG 
1 0 TAG AATCTCCAGGGAAAAGACAGACCCGTGAGAGAGCACGGAGGACAGTTC 
AATCATGGGGAGCAAAAGGAGGACACAGAAGCTGAAGCAGCTAGCGCCTTC 
TGACCC.TGCTGAGATGAAAGCAGAATTCAAAGGTCGTTCAAGATAGTACAGT 
GATAACCCAATGGAAGTTCTGGGGTTAGACCACAAAGAGCCCAGGGTACTCT 
CCTGATGGTGCTAAGTTTCATGGGGGTGTAAATGCCAAGCACAGCTACAGAA 
1 5 AGCTACTTACCCCCCTCCACTCCACGGTCCCTGGATCAGTGGCTCTGCTC AAG 
TATATTCCCCAGGCCCAAGAAGGGTATAGGGGTGTCCCAGTATTAGCAGGAA 
TTAG<3AATACACCTTGTTTACAGAACCATAGGTCAGAATGATCTGCAAAATCT 
GGGCAACACTCATTTAAAAAATAAACGAAAAAAAAGAAATAGAAAGAAAGA 
AAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAGGAGGGAGG 
. 20 GAGAGTGTGTGAGTCCAATGTCCATGGGAATATGCAGTAGTGCCATTAGATG 
GAGGTCCAGGAAAGGGTAAAGATGGCCTGAATCTCTATCTGTGTCATATCGG 
AGCTAGGGTGTCACAAATCCATCTCTATCTCCTCCACCCTTATGGTCATTTGTA 
AACAGGGGAAGTAATACACAAAGGCTGCTAAGTGTGATGTCACTGTATTTAG 
CTCTTGACAGAGCTGAGGAAGTGATCCACAAAGCAGAACCTGGCATTTCTTC 
25 CCGAATCCACTCAGGGGCTCTAAGATACCTCTGCCCAAAGCTTCCTTTCTCCA 
GGTGGCCCTGCCACCTGCTCCTGCAGCCTAAAAGACCTTCTGTTTAGAGGTGG 
GGGTGGGACGCACCCAGGTGGCATGTCCAGGTTCCTGGAGCTCCTTCCCTGGC 
ACAAAAGGATGCATTCATGACTCTATACAGGATTGTGTTGCTTCCTCCTTCTT 
AAGGGCTAATCCATAACAGACTCAAACTTTAACAAGTCGAGGAGCTGCCAAG 
30 CATGCTGGGTCCCTGCCCAGCTGCTCTTGTGGTTTCCCAGAAGCGACTGAAAG 
GGTACAGGCTTGGGACAGCATTCTACCCTGTGCTAGCTGCCCACTGCCCAGCC 
" TTGGCCAGACTCAGATAGGGGCTTCTCTGAAGCATTCCTATTGCTACCCATAT 
GAAGTTCTCTAGGAAGGGACAGGGCTCCTGAAACCAGCTTGGTAGGTATGAA 
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TGGTCACCACAAAGCACCCCAAAATGCTCCTGGAGGCTGQCTGTAACAGCAG 
CTTGTTCCTTTGAGTTACCATTTGGGCCGCAAGAGAAGAACCAAAGGCAGAG 
GCAGACAGGGTTGGGGCTTACTCAAAGCCTCCTGAGTACATGATGCAAGCTA 
GAAGGTGCACCACAAAGTGCTCACCCACCTAAGCAATTCGGAGGTGCTGAGG 
5 GGCCCAAGAACCTCTCCTGTGATGGCAGAGGTTGTGAAGACCCTGGGCAGCT 
GGGATAAGTCAGAGTAAAGAGGCTCAGACTGGTTTTAGGGGCATTGGTTAGG 
TAGAGGCTGACTTAGGTGCTGTCCTTCTAGCTGGGGTCCTGGGCCTGGTCTGG 
GGTATAGAGGATGGGGACAAGGCCAGATTCCTCTCTGAGAGACAATGGATCT 
TGCTCAGGCCTTCCTAGGAGAGGTAAAGACTTACAGTCTACCAAAGGAAGCT 
10 CAGGAAGGAAGCTGGAGAGAGGGGCAGGAATCCTCGCCTTGGGCAGGAAGC 
ATGGTTCCTTTCACAAGGCCCCTCGTCTCAGGAAGCCACTGTCATAGCATCTA 
AAAATAGGCAGCAAAGCAAATGCCTAGCTCTGGGGCCCAACTGACAGGCTAG 
GGGCAGTGGTGTGAATGGCCCAGGCATGGCCATGGGAGAATATACTGCTGTC 
TTGAGGTTCAAGAGATTGGGCTGGGCCACCCAGCTGGGCCCCTTCCAGGGCC 
15 AGACTACATGGCTCCTTCTCAGGCAAGGAAACTAGAAGGCAAGGACCTTCCC 
CATCTATCCCGCACTTGCCCTCCTCTAGGAAGGCCAATGTCTCGTGAGACCCC 
ATAGGGAGACTGTGTGGACCCAGCCCGCCACAGCAGTCTGCCAGAGATAAGT 
GCTGCAGGGAGTCCTGGTACCCAGAAAAAGGAACAAAGGGAGGCAGGGGAC 
TGAGAGTGGGTAGAAGAGCTCTATGTAGTTCTGGCATCACCACCAGCAGCAC 
20 CAAAGGACTTCTATCTGTTCCAGGCTGGGCATAGCCCTCTGTCCCAAGGTCAG 
GCTGTGGGGCTGGCAGGGTCTTACAGGCCACTTTGACTATGGAACATGGTGG 
CAGAGTGGAGTGTAGTTCTAGCCGATCCTACTACTTCCGAACGTGGTAGCTAG 
TTGCTTGGAAACCTCAGTCCCCTTAtCACCCGCAACGGGTCTGATAACAGCCC 
AACGGAGTGTTACTTTCGGTGAGCTGAATGAGCTGTTATCACGGACTCCAGA 
25 GTACAGGAGAAGACTGTTTCCAGGGTCAAAGGCCATGTCATGCCCCAACCAT 
GAGGATTTTTAGCCATGAAAAGTAATGAGAGTCCAGGCAATGGTGGCACAGG 
CCTTTAATCTCAGCCCTTGGGAGGCAGAGGGAGGCGGATCTCTGAGTTTGAG 
CCCAGCCTGGTTTACAGAGTGAGTTCAAGGATGGCTAGAAAAGCTCTGCCTT 
GAAAACAGACAAACAAACAAAAAAAGTTGTGAG/i AGGTGAGTTAACACTAG 
30 TGGGTTGGTGGCAGTGACAATATGTGCCTGTCACTCCACAGTATGTGCCTGCC 
ACACTCCACAGTATGTGTTTGTCACACTCCACAAGCCCTTGCAACACAAGGTC 
CTACTCTAGAGCTCAGGGATGGGCTTCAGGAGGACAGGTGGCCTGGGCTTCC 
TCATTACTGCTCCCCGTCACACGGGACATCTTTAAACACCATTGGAACCCCTC 
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ACTGTTTAGCTGCAGTACAGAGCCAACATTGATGCTTGACCATGGGTAAAAG 
CACAGCTAAATGAGTCATAGGATGCCACAGACCCACCCTGAGAAAGTTCTGA 
AGACCCAAGCTGACATCATGTATGGAGTGATCCGAGAGATAGTGCAAGGAAA 
CACAGGACACTCGGGGGCCCTGTTACTCAGCAGCTTCTGCCCAAGCTGCCTGG 
. 5 CAACCAGGCTCTTGGCTTCGGTTACCAACTCAGTGCTATGGGGGTATAAAACT 
ATGGGTGCCCTGGTTCTAATGCACTCCCCCCCACACACACTCCAGACTTCATA 
TGAAGGAAGGCAGGTGCCCTCTCCCAACATCCCCTTGCAGTTACTTTGGTAGC 
CATTCCAGGATGCCCACCACGAGCTAGCTGCTCACTGGCCAACCCCTTGTGAA 
CAGTTCCCTCAAAGGTTGAGCCCACTCAGAAGACCCCAAATAGTTTTGAATG 
10 GCATAGCCTATGCCCTGCAGCTAGCCAGGGAGAGTAGCTTAGACTGGATCTA 
GAAGTGATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTGTGTGTGTGTGTGTG 
TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGAGGGAGGTACTAGGGGT 
CACAGTCAAGTCACTGGCATGCCAGGTCAGAGTACTGAACAGTTGAAGGCTA 
CTGAAGGCCAGTTCTCAGGACACCTGGTTCTCTAGCAACAGTCCTAGGAACA 
1 5 AAATGGCCCTACCTATAGGGTCTC AGAGTCACGCAACACTCACACTCAGAGT 
CACCCAACACTCACACCGTGTCCTACCTGGTTCCCTGGCATCAGGGCCAGAGC 
CACAGGTACCTTGACTTCTCCATAACTCTGCCCCACCCTACGCCCTCTCTTGTG 
TACAAGGATGATTGCTCTCTCCCTAGGGCAGCTGGACTCTAAGGCTAGAGGTT 
TAATAACTGAGGTTCCAATCCCCTTGGCAGGACTCTGGGTTCTGGAATCAAGG 
20 TTGGCAGTTGTTGGTTCCCTCCTAGAGCATCACAGAAGCTCCGTTTGAGGTAG 
GACTGAGCTAGAAGGCAAGTGCCGAGAGAATGCCTAAAGGGTTCGGAAAGA 
ATGGGGGTGGGGGCTCTTATTATGAGGTCGCAGTGTGTGGGCACCAGAGTAG 
AGGCTGCCAGGTCTACTCCAGCTCTGTGCCCCCACTGtTCGGATGTCCTTTTTT 
TGAAGCCAGCCCAGGGATGAACCACAGACTTCCCCTTTGCAGGCCAATGATT 
25 GTCTTTCCCAAAGCCGGGCGGAGCAGACAGCCCCAGGAGCAAGGGGCAAGTC 
ACAGGAAGTGCTCATTCCACCCTGAGCCCACAGAATTAGAGCCCCTCTGCCG 
GAAGGCAGCCAGCAGACCACTATGTGACAGACAGTCACGTATGTGTGTTGCT 
GAAGCGAGCACAACAGACAGTCATGTTCAGTCTGGAGAATTAAATGCCAGTT 
GAGAGGCTGTAATCAGCCTTTTGTGGTGGGTCTGGCTGCCTGGGCTACTC. jAC 
30 GGGGGCCACCAGCAGCCTTAGGCTGGAGGAACAGCCTGTTCTGTCGGGGAAC 
ACAGTGAAGATTCTTCTCCTTAACTGACTACTCCTATGGAGACATTGTCACAC 
AAAGCTCCTACAGACCCACCCGAGGCCATAGGCTGGCACAGTGATAAACCAT 
CCCCTGCTCTCCTGCCCTATTGCCAGCCGGTGACTATTGCCAGAGGAGACTGG 
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ACCCTTAATCCCAGACCTGTCTCAGGGGATGAGTAACCACCCTGCTTCTAAGA 
GCATCTTACATGGTCTCTGCTGCTCACCGCTCATCTTTCAGGTTTGTGTCCTGC 
CCCATATATCTTGGGAGTCCAGAACTCTGCTGGCCeCAATATAGGTTCTGAAC 
TCAAGCCCAAGGAGGACTGGTGGGAGTGACAGCAGGCAGGTTAGTGTGCCCG 
5 GAATGTCCCCTCCCACAGAGAAACCACTTTAATCTTCAAACTGGAAATTTTAA 
AAAGTGTCCATCGAAACATTTACTGTCAACCAAATCTCCCTTCTTAGAGCTCT 
TAGCCCCCGCCCCCACCACGGCCCAAAGGACCTCTCAGCCCCCTCCTTAGGCC 
CTTCTGACATCATTGCTCAGCTTCCTTTGGCCCCACTCCCTTTTGGAGACTGTG 
TCCAGTGTCCATTTCCTGAGCCTTCTCTGCTCCAAGCCCCTTGCCCCACTCTGG 
10 AGTACATTAGCTAGTCTACACACAATCCTAGGAGACAGAGACCCCCAAGAAC 
TCCCACTTTAGAGGGAGGAAGTGAAGACTAAGGGCCAGAGTGGTGGGGTGG 
GCTCCCCAAAGGTTCAATTTCATGATATCAGGGAACTATCAGCCACAGTACA 
AACCTTGCCACACGAGGGACAAAAGGAGTCCCTCTGCCAGGGTGGTGTCCAC 
CAGTTGTAGCCTGGTTCTGCTCCACATCCCTATCCACTGTGGGGCCAACATGA 
15 GTAGGGAGGAGAGGTCTGTGGGATTGTGAAAGGCTACAGTTTTCACTGTTCT " 
GCAGTCAGTAGGCAGGTCCCTGGGAGCAGGGAGAGGCTTCAGAACCGTTTCC 
TGGAGCCTCAATAGAGACCCCAGGGACATGTTGGGACTCCACCCCAGCTGCC 
TGGGAAACGAGCAGCTTCACCCAGCCTCAGCACTCCAGGGAGTCTGAGGGTC 
TAGCGCTTGGATCAGGTGGGTGAGCAAAAAAAGAAAAAAAAGAAAGAAAGA 
20 AATACCCTACAGGATCCTGACATCCCACATCTGAAGAATAATGGGGGTGGGG 
TGGGGTGGATAACACACTTCTTGAGGGTGCATTCAATTCCTTGCATGGTTCAG 
AGTAGGCAAGCACTGTCCAACATGACTACCCAGTCGAGACCCCAAGGCATGC 
CCGGGCAGAGCAAGTTTAACCCCATCGCCCATGCTGCTCCCTAGAATGTGTTC 
TCTGGAGTCACCGTGACTGACAATCTTGGTCCCTGCAGCAGCGCGTGAGCATA 
25 CATAGCCCCAGCTGAGTTCAGCCCAGTTCCTGCCCATCCCCAACGTCTTATTC 
CCCTCCACCCCACCCTGACCCGCCCTGGGATCTTCAGCAGTCCTCGATGGAAA 
ACAGGTGTCCCCTGTCCAACACACTCACTCCAGGGGCCGTTGTACAAGTTTGC 
CCTCTCAGGAACAACTTrAGAGAAACATTGTCTTGGCATAACCTCATTTCTTC 
CTGGGAAGGAGGGTGGTGGGAACAGCAGTCTGTGCACTGCAGCAGGGAGGG 
30 TGGCACCACTGTCCAGAGGAGTGGCCAGCACTGCTCACTCAAGTGCCCGCAG 
GCAGGTCTGTTAAGACTCCCCATTCCCAACCCAGTTAGGAAATGGAAAGAAG 
CCCATACCACCAATGGCTACTCCCCAGCAGTGAAGACAGTCAGGACCCAGGG 
ACACTTGGAGACCCTCTTATACCCATGTCTCCTTTCCAAGATGACCAAATGGG 
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TGTGCACTTCTTTCCAAAGTGGAGAGCCCAGCTTAGATCCTGTATATAGGAAA 
AGATCCTAAGCTCCTGGTTTGTACAGATCCCAGATCTTTGCAACACCCTGGCC 
TTCACCAACTCAGAACTGCATACCCTCTTCCTCTCAGGATCTAAGGGTCAGGC 
CGGGGCTGTAATCTGTGCAGGAGACGACAGGCAAGGAGGAGGGGCAGCTGG 

5 GGGAATGGGGGTGGGGGTGGCCAAGAACTGGGGCACCGTGCGCAGCAGCAC 
GCGGCCCGGGCTGCCCTGCCTTCTAGGCCTGGAGCTCGAGCCCCCAGGGTGG 
CAATACCAAGCTGGAGGGTAAGCAAGGGTGGAGTAATAGTGGGGTTGAGGTC 
TCCGAGGCAAGTACGGAGCGTGCCCCCACCACTTCGAGGCCGCCTATCCAGG 
GGCGGTGGCTAGGATGAGCTCATGGGGGCGGGGCGCCAAGATCCCGCCCTTC 

1 0 TCGACCCCCCTCACTGCTCGCGATGCAGGATGGTCCCCAAAACAACATTCAG 
GAGGGTGCGAGTACCCCCTTCATAGCTGAGGGCTCTTGATTACCTGGCTCTAG 
CCCGCAGACTCTTGCACCAGTTCCCAGCACTCAGAGGGGACAGGCCAGCGAT 
CCCCTGATTTCCCGGCCCCAGAAGCCCCTCGAGCCGAGGCCCAGAAGCTCAG 
CCCTCGGCCTCAACCCGCTGGCGAGGGGCGCCAGGACCCCGATCGGCGCACG 

15 GAGCCCACAGGTGGCGCCGCAACTTCGCGGTTGCGCTGCGCCTTCTCAGCAG 
GGCCTAGCGCGCTGCCCAGCCTGGCTACCGTGGGGTGCGCCCTGTCCCCAGTG 
GCTGCTCACCCAGAAGACGAAATTGAAGACGAAGAGCAGGTATTTGATGCAT 
TTGGTGCAGCCCTCCACCCCCATGGCGGTGCGCGGCCTGCGGAGCTTGGATTG 
GTCCTGGAGCGGGCGGTATGGGGACGCAGTACCGGGGTCCGGGACTGGCAGA 

20 TAGCCTGGAGCGCTTGCGGGCCGGAGAACTGCGTTGCGGAGAATGAGACGTA 
GGGTAGAGAAGGGCTGGAGGCCTAGGCGGAGGCGCGAGAGTCACTATCTCTT 
TGAAGCAAGGACGCGCTCGCTCGGACCGGGCGCGCGCGCCTAGCTCCTCACT 
CTTTATAGACCTCTAGGTCCCGCCCCTCCGAGGCCCCGCCCAGAGGACGCTCC 
ACCCCTGGCCCCGCCTTCCGCCCGCGGGGTGCCCGGACCAGGGGTACTTGGG 

25 TGGGTGCTGCCACCTCAGTCCCAGAGTAAGCGTTCTGGAACTTGGTGCAGCTG 
GTTCTTGGGGACTCGGTGCACTGTGCGTGCGCAGGATGAATCTCCAAGTTTCT 
TAAGGGTTCCCAACCCTCGGGGCAGGCTTCTGGGTCGCGACTAAGAACTGGC 
CACAAGATACTGCCCCAAAGAAGCCACACTTACTTCCTCCGGTACTCGGACG 
CCTTGGACTTTCCCGGAGGTACCCTAGTGGAGGTGTCTCGACAGTAAAGAGTC 

30 TTAAAAGTTAACATGTTTTGTGTATATTGATCTCCCAGGCAGCAACTCCGTGG 
GCCTGAAGGTCTAGAATGGAATTCTTTGTAGCCTTGCTAATCCCCTATCTGAC 
AGATAGAGTTCCTGGCTGCAGTTCTGGAGGCTGCATTTGGGAAGGTCAATAG 
TGTACCCAATGTCTTTAGCCAGTCTTTAGGCTCGAGAACCCTCAGGGGCACCT 
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TCAAATATTCATGCCATCCTGGCTGTGCCTTCCAAGAGTCCATCCTCCATCCTC 
TAGCTGGTTCCCTGAAAGCTGGGATTAATAAGGAAGTGAGGAAAGATTCAGA 
GGGCAGAGGTGCTTCCTTCCCGGGTGAGGAATGGCTGCTGTCATCATAGATG 

atattcctgtgaaatcttggcattcccttccccaaactgaagggcaatggggc 
tgcttcattagactcactgggaaccatggaaggaccagttcccaagactcag 

CCTTITCCAGGTCACTCCATGTTCTTAAATCTTTGAGCAGTTAAGGACATTGTT 

CCCTATGGAGATCACACCTGGCACCCCTCTGGCTACAGGACACTGGAAATAT 

GTTTAAAAAGGAATAGATATACATATATATTTAAACAAAGGAAAGGAATATC 

TATAATATATATGTGGAGAGAGAGACAGACAGACAGAGACAGACAGACAGA 

CAGAGAATGAGAATCTCACAATGTAGCCCTGGTTCCTGGAACAGAGCTCTGT 

CTACTTTTGCTTCCTAAGTGCTGAGACTGTAGGTTTAGGACACCATGCCCAGT 

TGTCTAAAGGGCGAAAATTTGAACTATGTATAATTTACATGTAAACCTGATGA 

TTGGCCAGTAGCTGCCTGGGTGGAGCTATCTCTACTCGGCTAGGCTAGTAGGC 

TTCTTGAGAGTTAATTTGTGCATTTGTTTTGGCTTCAGGCTGGTATAGCTGCTA 

CAGTGAAGGATTGTGTCCACATTTATAGTTCTTGGAGGCTGAAGTGGTGTCAC 

AACAAAAGTAACACTAGTCACCAAATGCTGGAGTAATATCACACTAAAAGTC 

ATTTGAGTCTCCAGGAGGGAGTCTCTTAGGATTCTTTTTATTrAAAAGTGGGC . 

AAGAATAGTTCTTGCACGTGCAGAGATGGGGACCTGCTTTCTCTTCAGATCTC 

CACTGTCTCCAGTGATGGGGATACCTCATGCCTAAAGTGGGGTTACACATTGA 

ACGTCATGCAGGGCCAGCTGTTTTGTTTATGTACTGTGTGATACATCTGGGTT 

TAGTGAATGCCATTGTGTTCCTGGAACTGTTCCAAGGGCTATGGACAGCCTAG 

AAACCAAGGCTGCTTCTTTGGGGCCCTTAGGTTCTGGTC AC ATAC ATGCTATG 

GAGAGAGTTTCCATGATGTTTGACCAATCATGGTAGGGAATTTTTTTGT 

CAGTTTrCCTCCAAATTAAAACATCCCTTTCTGGAGGGAAAGTATTAAGACTC 

CGG AAATGAACAACTAACAAGTTTTAGACGTTCCTGAAACTAACAGGTCCCT 

TCCTCCCTAAAGATACATAATTAGGAAGACAGCCCATGCTTTGACCTATTCAA 

GTGCCATTCTCCAGGGTTCCAGCTTTrGTGAGCCATCACCTATGCTGAGGtTTT 

TGGTGACACAGCTCCCTTTGAGTTGTCTCTGATGCTGTAAGTAACCCCTCACC 

CATACTTTTGTAAGTAACCT AAATC AACTAATCGGTTTAC( j AAGTTGGACTTT 

GGTAGGAAGCTCTTGGATCAGGAACAGATATGGGCAGAAGGCCAGACAGGT 

CTGAGTTAGGTGGAAGGCAGACTTAGGCTGCATTCTGTTCCCATTGAAGGGGT 

TGGGGTAGGGAACAATCTCATAGGCCCAGGAGACTTGCTGAGATGACTGGTT 

GGTTGCAAGAGTACAAGGGGTGCCTTGCAGAGCCTGCACCTGTGAGCACTAC 
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ATCTTAAGTGACTTGTCCAACATCCTGCCAGCTGGCCTTATAAGGACATGGGG 
GAACCAAGTAGGTACCCAGAGGGTGCATCAGAAAATGTCCCCGGCGGTTAGC 
TCAGCCTGAAAATATCTCTTTTCTGGCTTCCAAGAAAACAGTGATGAAGCTCA 
AGCCCAGGATGGGAAGTTTCTATTTGCTTTCCCTTGTCCAGCAGGTGTGGGAA 

5 GATGGGAAGAATGGGCCTTCCTAGCCAGGGCTTGTCTCACAATTCTCCTGTCT 
GTATGAACTTGTACTGGCTTCTCAGTTTCCCCAAGTGTGACAGAAATGCTGCA 
GAAAGTCTCTAAGTACAGAAGGTCAATGTGTCCTTGTTACTTGGAGCAGGGC 
CTGGCCTGTGGG AAAGTCAAGGTCAAGCTCTTTTTGGCTCAGTCCCAGAGTCT 
TGTGACTCACTCCACAGAGGAGTCCTGCTGTGTGCATTTGGCAGGAGGTAGG 

1 0 GGG ATTTTGTTGTAG AACCATGGTTTCCTAAAATGAAGACTCCCC AACCTGGG 
CTGTTTATGTAGGGCTTGCTCTTGTAGGGTTTGTTGGCTGCCACCATCATCACT 
TCTGCTTGCTAGAAATTCATATGCAGGTCAGGGAGTAAGACAGCTGTGTATTG 
AAAGAGCTCAGCATCCAAAAACGGCCAAGCAATGCCAGACAAAACTCTTGTG 
AGCATCTCTAGAGTCACAGGAAATGCTGTCCTCTATTCTCCAAGCCCCACAAC 

1 5 TCTCCTGATAAAATACTTCATTCAGCAGCAATAGAAGGGGCTCTGTCCTCCAG 
ACCTGAGGACGAACTGAAGATCCATAAAAGTCTCGAGCCTGTTCTTCTCAAG 
GAAGACTGAAGGCTCCTCACCCACCAAGGTCAGAGACGTAACAAAGTGAAGC 
AAACAAGGCAGTCTTGGCATTTCTCCCCAGGGTTCTAGGAAGGATGATCCAG 
CAAGATGGAAAGGTGCCATGCAGGACAGGGTGCTGGCAAATGTTTAAAATCT 

20 CCTGTGGGACATACCTACAGGCATTCACACTAAGTGTGAGTGACACCATCCC 
ATAAACTTGGGTTGTAAACTGAATAAAAATCAGAAAGTGAGATGAACATCAG 
TGTTTATCTCTGCTTCCTGATTGTGGGCGTTAAGTGACAAGCTGCATCACACT 
GCTGTAGCCACACCTTCCCTGTCATTATGAACTATACCCTTGAACTGTGAGCC 
TCAACAAATCTGTTCTTCTTTATGTCACTTTGTCAGGTATTTTTGGCACAGAAA 

25 CAAGACAAGACATTGAACAGTTGCCTGAGTTAAGGTGTTTAACAACGTGGCA 
GATATAAAATCAATATGAAAAAGACAAATATTTCTATAGACTAGTAGTAAAT 
AATCCAAAATGGAGTCTAAATGATGATCTCATTTAGAATAGCAGCAGAAAGA 
ACGAAACATCAAGGGATAAAGTCACTAAAGATGTGTGAGACTTGCAGACTTG 
'■AAGAAACCAAGTATCTAACTATATTTGAGAGGATCTGACTTTCACAAATCA 

30 AAGACAAAATTATCATTCAGGTCATAACACTTCAGAAGTTGTCTGGCCTTTTT 
TTTTTCTGATTCTAAATTCAAGGTTATTTTCTTTTTAATTAAAAAACATTATTT 
' GTTTATTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCCTCCCTCCCTCTCTC 
TCTCTCTCTCTCTCTCTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGCACATG 
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TACACGTACATGCTATAGCATGTGGCATGTGTGTGGAGGTCAGAGGACAGCT 
AGATTCTCTGCTTCCACCATGTGGGTGCTAGGGATTGACCTTAAGGAATCAAG 
CTTGGCGTCAAGCTCCTTTACCCACTGAACTATCTTGCAAGACCTGACCeTAA 
ATTCACAGGGAAAGGTAAAGGACTAAGGATAGCCAAAACAATTTTGGAAAA 

5 GAACAAAGATGGAGGGCTTATACTTCTTGATTCACAGTACAAAGCTAGAGTA 
GTCAGGAGAGTATTGTTCTAAAGTCTAGTATGGTAAAATATGGGTGTACTCCC 
AGCATGCAAAGGGTGAAGGCAGGAGAGTTGGGAGTTCAAAGCCAGCCTTGA 
CTATATGAGACATTGTCTTCAAAAGGGGAGGTGGGGGAGACAGAGGGATAGC 
ATAAGGATAGCTATTTTAGTCGGCTTTCTACTACTGTGATAAATACCAGAACC 

10 AAAAGCAATTTAGGGTGTGTTTTGGTTAGGGTTGCTATTGCTGTGATGAAACC 
CTATGACCGAAGCAACTTAGGGAGGAAGGTGCTTATTTGGCTTATACTTCCAG 
ATAACTGTGAATCACCAAATGAGGTTGGGACAGGAATCAAAACAGGGCAGG 
AGCCTGGAGGCCCTGCTGATGCAGAGGCCATGGAGGGTGCTGTTCTCTGGCTT 
GTTCCTCATAGCTTGCTCAGCCTGCCTTCTTATAGAAACCCAGGGCTGGCATC 

1 5 ACCCACATTTGGCTGAGCCCTCCCCCATCAGTCACTAATTAAGAAAATGCCTC 
ATAGGCTTGCCTATCTTATGGAAACCTAATCTTATGGAGGCATTTTCTTTTCTT 
TTTTTTTCTTTTTCTTTTATTTTAAAATTGA 

AGCTTATGTCAGTTGACATAAAACTGTCCAGCACACAGAGGAAAGAGTTTAT . 
TTGGCTTATGCATGCCAGCCATAGTCCATCATTGTGGAAAGCCAAGGCAGGA 

20 ACGGAAGCAGAGGCTATGGAGGGATGCTGCTTACTGACTTGCTCCTCAAAGC 
TTCCTTAATTTGCTATTTTATGTAGCCCAGGACCACTCACTCTGGGGTGGCAC 
CAACCACTATCTGAGCCCTCCCATATCCATAATTAAAAAAGAAAATGTTCCAA 
AGACCTGCCTACAGGCCAATAGGTTCCCTCTTCCCAGGTGACTCTACCTTGTG 
TCAGGTTGACAAAAAGCAAAACTACCAACAACAAACAAAACAAAAAACTAA 

25 CACAATAACAACAGAGATCAATGGAATAAATAGAATTGAAATTCTAGGAATA 
GATCTTTCATTTTAAATGAGAGGACAAGACAGTTCAATGGGGGGAAGGAATG 
GTCTTTTTAAACATGGTTCTAGTATTGCTATCCTTGTACACAGAAAAACGAAG 
ACAAATCTATCCAGTGTAATACCTGCCAACATTATGATGAAAATGAGACCTA 
GGGGATAGAGCCATG/ CTCAATGACTAAGAGCACTGGCTGTTCTTCAAGGGG 

30 ACCTGGATTCATTTCCCAGCACTCACATGGCAGCTCACAACCACCTGTAACTC 
CAGTTCCAGGAGATCTGAAGCTCTCATATGGCTTCCATTGGCACTGGTGTGCA 
GACATGTATGCAGACAAAACACCTATATACATAAAATAGTTAAAAATTAATT 
AGGCAAAAATAAATCTTAAAGAATGAGCCCTAGATCTGAAGGTAAGAGCTAA 
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ACCTTTAGCTGGGCATGGTGACACACACCTTCAGTCCAGCACTTGGGAGGCA 
GAGGCAGGCAGAGCTCTTGAGTTTGAAGCCAGCATTGTCCACAGAGCAAGTT 
TCAGGACATCCAAGATGAATTTTTTACAAAACAAAGAAAGGAAGGAAGGAA 
GGAAGGATGGATGGATAGATGCATGCATTGGTGATCTTGCCTTAAAATAGCA. 

5 ATTTCCTATCTATGATACCAAGCACACAAGTGCATGGACTTGGACCCAGGTTG 
CATGAAAGAAGAGAGGGAGATGACTGAGCGCCAGCATTCATCTGTGTGCTAA 
CTGTGAACGTGATGTGACCCGCTGCCTCATCTCCCTGCTGCCTCGCCTTGTCTG 
CAGGAGCCTCAATCAGGAGCCAAAATAACTTCCTGAAGTTAGTTTTGACAAC 
TATTTTGTCACGATGAGAAAGCTAACTCAATATTTGTACTGAAACAGGAAGTC 

10 CCAACATTGTATTGGCAGGAGATTGGCTCTGTTTGTTGAGAAAAGGGGAATT 
GAAAGACGTATTTGTAAATATGTCATGCTATCAGAACACAGCCAAGCACCAG 
CCTAAAGGATGCCTCTTAATGGGCTGCCTCGGAGGAATGAATATCCCTGCCCC 
TCTCTTTGGTTATCAGTCTGTTGAGACAATGGGCCCAAATAAATTAGATTTTC 
CCCAGAAATCTGGGGCAGCGGGGGTGGGGGGGGCGGGGGGAGAAAAGGTGT 

15 AAGTGCTGCAACAACTGGGAGAGGATTTTGTCTCCTGAAGAAAGATTTACAA 
CTTGTCCAAGGACACTGTAGGAAAGACTTGGGAGCTGAAATAACCGTGGGCC 
AAGGAGCCATACTTTGATCAGGGCTGCACAGTTATGCATACCTGACCTCTCGA 
CCTTAAGCTCATGTCAGCGTCTAGAAGATGGACTTGGGGCAGAGGATTGCTG 
GATCCCACTATCCTTCTTCCCAATGTAGCCATGGTGGATAAATCGCTCCCTTTC 

20 TGTTTATTAATTTGTCTCCTTTTGAAAAGGTGTATTTCCATTCCATTTATGTAT 
AAATGTGTCTGTGTGAGTGTATGCTCTCTGTCTGTCTGTTTCTGTGCTTCTGTC 
TCTCTCTGTCTCTGTCTCTCTCTGTTTCTGTCTCTCTGTCTGTGTGTGTGTGTGT 
GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGGTGTGGATGTGTGGGTGC 
CTATAGAGACCCGTAAGGGGTGTAGAAGCCACTGGAGTTGGAATTGCAAGTT 

25 GTTGTGAGCTGTCTGACATGGGTCCTCTACAAAAGCAGCCGCTCTCTTGACCA 
CTGAGCCATCTCTCTTGCGCCTTGACTTGGCATTTTTAACGAGTTTAAGGATG 
AGCAGCCCAATCTGGCTTGTTGGGACTCAGGCAGTGACTCTAAATCCAGTAAT 
ATCTCTATGCATGCAAGTACCCAGATACCCAAAGTGTTTCACTTCrCTACTTG 
AAGAGTAAATAGAGACCAAAATATGAGCTGAACTTGCAATAACCACGGTTCC 

30 CACACTCTCTGTAATTCTGTATGTGTGTGATATACTTTCATGAAAATACTCATT 
TCTAAAAGATACCTGTGACAACAAATAGCCATCTTTGCACTGTCATGAGAGG 
AAAGTCTTATCGTCATAAAACTTCTAAATTTCCCCAAAGGAATCTGATTTTGT 
CTTGAGGTGGGGTTGATAATCAAGACATTTGGCTCAAGGATTTACCCATATGT 
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TGAAACATGTGTATGTTCAACCAAGAGCAATGAAAACACATCCATACAAGAA 
CAGATCCACATACTTTTTGTATGTGCATTTGTTATCATAGCCAAAAAGTAGAG 
CCAGGGCTGGTGAGAtGGCCCAGAAGTTCAGAGAGCTTGCTTCTCAAACACA 
GGAACCTGAGTGTGGGTTCTTATAATGTTCTGTAGTTTCAGCTCAGAGGGATC 
5 TAATGCCCTCTTCTGGCCTTGACAGGCACTGCAAGCATATGATGCACATACAT 
GCAGGCAAGGCATCCATCCATACACTTTGTAATGACTAACTTTGGCTGTTAAC 
TTGACTATATCTGGAATCAACTAAAACTCAAGCAGCTGAGCATTCCTTTGAGG 
GATTTACTTGGTTGCTTCAGTTAGGAGGGGAGACCTAACTCTGGGTCATACCT 
CCTGGTAGCAGGTCATATAAAAGAACATGGAAGAAGGAAACTTGCTTCCTCC 
10 CCTGCCCCCATTTTCCCTCACTCTTATTGGCTTGTTCATGTGTTTTGTTGGTGA 
GTCATTCCTTTGCTGGTGTTAGGATCTACITITTTTTTT 

CATAGACCAAAGACCAGCATCTCTTCAGGAATCCTCTAGGACTCCAGCACTA 

GATGGTACTGCTGAGACACGCAGTCTTGGGGACTGAACAACTGCCAGATTCTT 

GGCCTTCCCACTGGAATACTGCTATTGTTGGAATACCAGGCTACAGCCGGTAA 

15 GTCATTCTAATACTCTTTTAATATATTTGATATATATTCATTTTATCAGTTCTGT 
TCCTTTAGATAACCCAAGGCAAACATAAAAGTAAAATAATATATTTTTAAGTA 
CAAAAACTAAAGATGCCCATCAAGTGACAAATGTGTCCACATGACATTGTCC 
AGACCACAGAGTGTAATGCAGCCATTGAAATGAAGGAAGCCATGATACATGG 
TATAACTCAGACGGGTCTTGAAAATGCTGAGTCAGGTATTGTGACATATGAGT 

20 GTAACCTGGAAGACAGAGACAGGAGGATCTCCACAGGTTCCAGGCCAACCAG 
GCCTATAAAGTAAGACCCTGTCTCAAAACAAAGACAAAACAAAACAAAACCC 
TTAAATGTTTTGTTGATGAAAAAGGCCAGGTGTAAAAGACCTTATATTTGATG 
CTTCCTTTTAAAATGCTGAGAAAAGTCAAATGTATGCATTCCTTCGATTCCAG 
CACTTAGGAAGCAGAGGCAGACAGGTTTCTGTGAGCTCAAGGCCAGCCAGCA 

25 TTGTCTACACAGAGAGTTTCAGAATAGCAAGACTTCATAAAGAGACCCTATCT 
AGGCAGGCAGTGGAGGTGCACACCTTTAATCCCAGCACTTGGGAGGCAGAGA 
CAGGTGGATTTCTGAGTTCAAGGCCAGTCTGGTCTACAAAGTGAGTTCCAGG 
ACAGCCAGGGCTATACAGAGAAACCCTGTCTCAAAAGAAGAAGAAGAAGAA 
GGAGGAGGAGGAGGAGGAAGAAGAAGAAAGAAGGAAGAAGAAGi.AGAAAA 

30 GAAGAAGAAGAGGAGGAGGAGGAAGAGGAGGAGGAGGAGCAGCAGCAGCA 
GCAGCAGCAGCAGCTGAGAGTATGTAAAGTAATAGTGGTAGAATAATAGAGG 
TAGAAGGGAGGCTTGCCCTTGTCTGGAGCTGTGAAGGTGGTGCTAATGCTAC 
CCTGAGAGGTCTTATCACAACACATCTTTGGGGTTTTGGCTGCATACATCTGG 



WO 02/054069 



94 



PCTAJS01/49808 



TTTCTGCCTCAATGATAGGCTGTGAACTCGAAGTATACGCCAAGTTAACAATT 
TCTTCCCCTAAGTTTGCCTGTTGATCATAATGTTTTATCACAGCAACAGAAAA 
CAAACTAGAACAGGGAGGCCTCTGTCATTTCTGGTCATCACATTTAGCAAACT 
AGGAGAGCTCTGGGGTGAGAGGACCTGTCAGCAAGCTAGGGAATTTGGGAG 

5 GAGCCAAAGGGGATGAGGAAAAAGTAATCACTACCCATCAGTGGTTAACTGA 
ATGACTAAGCAGAGACGTAACTGAAGTGAAGCCTATGTCTGACACTTGGTCA 
GCGTGTGGACAAAAACCCCAAACCAGACAGCATAGGAAGTGAGAGAGAGAG 
AGAGAGAGTAAAGGCTAGAGTCTGAGCCTGGCCCTGGGGACCTGGAGGGTCA 
AGCTCTGGACTGTACTTTCAAGTGTTCTCTCTGGTGGAAGGGGAACCAGCAGG 

10 GAGTCAGACAAACCCACAAGGGCAGTAATACAGGTCCAGGAAGAGCCACCA 
CAGTGACTGGCATATGGGCAGAGATAGTAAAAAGGTCACCGGAAGACAGAC 
CnTCCTGCTTCAGGGACAATCAACAGCCCTAACTTGTGAGGTTTTTCAAAAGA 
CTCAAGTCTCCATAGTCAAGGAAGAAAAAGGAGCTCAAAGCAGGCCTGTGTA 
TGAGAGACTACAAGTACCAAGGGCTTCAGCGCTTGCCTGTGGCCTGCAGTTG 

15 CCTATGGAATGGTTTGCAGTGAGGTGCTGTCCTGGTATTGGGAGGCCTAGACT 
TGCTGTGCCCAGGAAAGTACCTTGGCATCTTCTGGAATCCTTGTGATGCCAGA 
TCCAGTCAACTCTCAGGGAGCTTGTGGAGGTCACCTATGAGCCAGGATGAGT 
CTTTCCAGGATGTGCAGGGACTAAAGTCCCCCTTTAGTCCAGCTGTTGGAGGA 
AGAGGGCAACAACAATTCTGGATACCTTGGGGGTTGGGTGGACACTGCAGCT 

20 GTGAAGGAAAAAAAAGGCCTGTCTGGCCTCTGGAGAGCTGTGACCCTGACCG 
TAACTGGCAGGACATTGCAGCTGCCCCTACTGTGGGACCCTTGCCCTTTGGAT 
GGTGCA'GTCCCAAGGTTGGCCAAGTGCACACAGTTCTCTGGGTCAGAGAGGG 
CTGGCTAGAGTTCTATATTATTTCCACCTCCCGACTCCTGGCAAGCAATGGAA 
TCCGGAAATCTGGGCTGAGCCAGAAAAGGAGGCCCTGTCCCTGAGGAGAGGG 

25 TCAGATAGTCCACTGTCTGGGCTGGGGGTGGAGCTTCAGCCTACAAGGCTAG 
TCCTGTCTTCATGGTCTCTGTGTCACTCTGTGCAAAGACCCCAGCAGTGCTGC 
TAATAAGAGGCCAAGGCTGTCTTCACCATCTACCTAGGAGTGCCTGCTCAGCA 
CTGCGAGTGTAGCCTGTTAAGACCCATCCCCTGAGGAGAGAGGGGTTCTGGG 

aagtga' : :cagggctggctgggatgctcttggtgccttcattcctgtggaatat 
30 aaacctagggaggcccaacagcctctgtctgtctcccacctctgtcccatcac 
catggattagaggggctagagggggctgaagggaagagtctgaagaaatca 

TCTCTGGCTTCTGCAGTGACTAGGCTCACACAGGGCTGATGCCCACCTATCCC 

agacatttctggtcagagccaggctgccccaaacatagcctttgtcccccttt 
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CCCCAGACTAGATTCTTGAGAGCACAAAGGGCACATTAAGACATCAGGACAT 
TTGCTTTGTCGCAGGGACACTTCCTCATCTATTTCTCTTACTGTTTAGACTATC 
AGGGGCTGCCTGCCTTTGACACATTGCCTGCCAACTTCCATGTGACGAGGCCC 
ACAGGGGAGGCCAACTTCTCTGAAGGGAATTCTGAAGTTCTCACAGAGAGGC 
5 TGCTAGATGAGACCTGAAGAAGTGGGCAATGCTTAGGAAAAGAAGGCCTCCA 
GCCTAGACCCTGATAGGTTAAGGGGGTCACTGGGCCTGGGTGGGGAGACCAC 
AGCAGGATGGAGTAGACTGTAGGAGGGTGGGACTTCTCAACAGAGAACCTAA 
AGTTTGGATTGAAGGCAAGTCTTTCAAAGCAAGGAAGTAGCTGGGAATACTC 
AGGGGAAAGCCAGGAAGGGAGGATGTGGTGAGAGAAAGTTCTGGAATCTAA 
1 0 CTATTTCTGCTTCCTATTTCCTC AAGCGTGAGTGGATGGCCAAGCCAGCTTGT 
GTAAGGCTGTGGGAGAAGGAAGTGGGAAGGTGCTCACAAGTGAGTCAGTGG 
CTGGGGGGATGGAGCTAGGTCTCAGGACTGATGGGTGAGTGGGTGGGGTATG 
GCTGGGGTCAAGACATAACATTACGGAAAGTGAATAACCATAAGCGCTGCTG 
GCTATTAATCCTTTGCAGTGGAACTGGGCTCTGGTTTCCAGATCAATACGTAG 
15 TAATTAAGAGAGAAAAATTATTAAATAAGACTCAGAGGCCTTAAATTCTTGG 
CAGATAAGAGAGTCAGAGGACTTCACAAAGGACATTGCCCCACAGGATGGG 
GACCGGGGAGCTGTGATGAAGACAGGTCTGAGTGAAGATGTTTGGAGTCCAG 
GCAGGCAAGAGGATGCTGAATGGGTTGAGGGCCTCTTAACAGGGCACTGCTT 
GGAGAGGGGTAAGAAGGAAGGATGTGCAGGTTCAGAGGCCACTAGAGAGGG 
20 GCAGGGGCAGGGGATGAGCCAGGAAGGGGTAGGATAGGGCTCCAGCAAGGT 
GTCCGCCTCTGGCCTTATCCGCCTCCTCTAGCCTGCCCAGGGTCTGGGAATGC 
TTGCTATCCCAAGTTTAAGGCAAGACCCACTGTGCACAAGGCTAATCATAGC 
AGGGCCAGCCAGTGCTGTGGATCAGCATCTGCAGGACACATGACTTCTGAGT 
CCCAGTGTCACTGCTGAGTGGAGGTGCAAGGTGCAGATGCAGCACCTAGTGG 
25 GGCAGACTATTCTCAATATCTGTCTCTGACTCAGTGCCCAGATTGCACATGTA 
TTGAGATAGATAGAGCCCACGATGAGGATGGAACTATAGGCCTGCAAGTCTC 
TGTGGGCATCTGTTTTTCTAGCCATATCAGGCTTCTGGCTCCTCTTGCTCTATC 
TAGGAAATCCAGGGTTTATGTTTTCCTGGCCTGCAAGGAGGCTCCTTCCCACA 
ATAGTGTCCAGGTTTTAACCCCAGGCTTTGGGTCCCTAGTCTGGCAGGCCTGC 
30 AGCATCTGCCTAGGGCAGGCTCCTTGTAAACCATAAATGCCCAAGGGCCAGG 
GACAGCTCCACCCCACTGTTGAGCAAAGGAGTAGATCAGCCTTTCAAGAGAG 
CTTCAATGTCCTGAGGCCATCTTTGGAGCTGATTCATCCCTTTAGAGACACTG 
GGAACTATGATGAAGATGGTGTCCTTTTGTCCCTCCATCAATCACTGCAAGGA 
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CACACACAGGTGGCCCTAGAAGGAACAAGAACTCAGGTGAGACAACTCCTAT 
GAGCTGGAGCATCTCTAGTGAGGAGAGCCTTCTCCATTTGACTACAGCTCCAC 
CCCAAGACACAGAGAAAGATGATAGGGACTTGTTAGGGAGATTAGCTAATCT 
CCCTAATATGGCTGTGGACACAGAAGTCTCATGACATCTGAAGACTGGGGAA 
5 ATAGCTAGTGAAGTCTAAATTTGCCTAGAACCAGGGGAGCCAATGAATTAAT 
ACACAACCCAAGACTGAGGGCCTAAAGCCAGGGGTCAGGGCTATCAGTGAA 
GGGCCTAGATCCAAAGGCTCTCCCAGAGAGCCTGATTTCATTAGAGAGGTCTT 
ATGTCCATGGGCAAAAGAAATCTACCTGGAGGAGGGGCCTAGGGGAGGGGA 
GGACCTCACTGTTTTCCATGTGGACCCTTAACTCAGTTCAGATGCTGCTGCCC 
10 ACATTGGGCGAGGTAGGTCTTCCCCACAACCACTGATTCAAGTTTCAATATCT 
TCCAGAAAAACTTCCAGAAACCTATTCAACAGCAGAGCTTTCTCACCTATCTG 
GACAGCCCATAACCAGAGTCAGGCTGACACCCTAAACTTCCAAGTCCACCCA 
TGTCAACATGACTAGCAACATCCCCCTAAATCTCCCTTTATCTCCCAGTAAAG 
ACTGCAAGGCCATGGTTGTACACAACATGAGCTAGCCATGTGGTGTATTCCA 
15 GGTATCCCCTGAACCCTGCACCAAAGGAGAATGTGTGATTCCTGGTGATGTTT 
GCTGTCCCCCTGACATCCATAACTCAAAGACTATGGGTCATGTGATTGAGGAA 
TAAGCTAGGGAAGATACATGAAGAAAACAGGTACTGTGTCCTTCTATGTGAC 
TACCTACCCACAGATACCCTTTTTTTAAACAAAAAGATTAATTTTCTG 
CK3GGTGCACAGTTTTAATCCCAGAACTCAGGAGGCAGAAGCAGGTGGATCTT 
20 TGTGAATTCAAGGCCAGCTGGGTCTACATAGAAAGTTCTGAGCCAGCCAGAG 
CTATACAGCAAGAAACACTGAGAATTTTAAAACTTTTCATTTTTTGTGTAT 
GCGTTTTGCCTGCATATGTGTGTGTGTGTGTGTGTGTGTGTGTAGTGCCCATG 
GAGGCCAGAACTAGAGTTATAGGCAATTGTGAGCTGGCATGCAGGTACTGTG 
AACCAAATCCAGGACCTCTGCATATATACCAGTGCTTTTAACTGCTGAGCCAT 
25 GGTTCCAGACACATTCTTAAAAGAGGAAGTGCTGTGGCCACCAGAATCTTTCT 
CTGGTCACATGATCTTAGCTAGCAGTCAGAAATCCCCCTTTCCACACCCCACC 
CCTATTCTGCAGCCCATTTGCCCTCAGCAACTCAGCTAGTCTTGGCTCTTATCT 
GACAAGTAACCTATACCTTCCCCTGGCCTGAGAGCCCTTATGGTAGCATTGAC 
TTTAACCCCATGGCGTGGTAACACTACGGACACAT^AAGGGAGCCATGAACT 
30 GCAGATCTCTCCTGGTAGTCAGGGTCAGGCATTTCACCAACCCCCAATTCCTC 
CTCAGCCTCAGAAGCTCAGATTGCGGTGTAGATTCCTTTACATCCAATTCAAT 
' GTAGTGTTTGTTGTATCTCCTTGCAGAAGTGTTCCTCATTCTGGAGCCAAAATT 
TCTAGGCCAGCAGAGCATAACACTGTGCAAAAAGGAAGTAGAAATTTTGCTT 
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AATGGGTCCTTGGGGGTTGTGGGTAGTAGTTCTATTCCCATGCCCACCCCTTG 
" ATTTCTGGATTCATGACTCATGACTATGGAAGAAATAGTGCTGGACACTGAAC 
ACTGAGGCAGAGAACATTCAGCCTTCTGGAGAATTCTGCTTCAGCTCTTATTA 
GCCACTCAGTACCTTATTTTCAAGTCAGCTGCTTCAGAGTGATGGAGACACAG 
5 ATTTGTGAGTCACATGGATGTAGCCTTCTGCCGTGCTTTAATTTGTTGGCAAG 
TGTATTCCTTTCTCAGAGACACAGCAGTATCACTGAGGTTGAAGTGTCTGAAC 
ACAAACACAGTGGTATAAAGATGGCCTTTGGGGAGGTGACTCATTTAGGAGG 
GCAGAGTCCTGACAAATGGGCTCCATGTTCTTATAGAAGGAGTGAGGGATGG 
ATCCACTCTTCCCTCCTTGCTAGGGTACAGCAGGGAGGCACCATCTAAGAGG 

1 0 AACAGGTCTTTCTC AAAC ACATTGATCTTC AACTTCTC ACCTTTCAGAAGGCC 
AAGAGACAAATTTCTGTTGTTTTTGAGTAAGCAGTTTAGGGCTGGGGAGAAA 
GCCCAGTTGGTAGTATTTGCTTTTAAAGCATGAAGTCTTGAATTCAATCCCCA 
TAACCCACATTAAAAAAAAAAAGCATGGCATGCTGGTGTCTGCTTATAACCC 
CAGTGCTGAAGAGGCAGAAATGGAGGTGGGGCTGTCCCTGGGACTCACTGGC 

15 TAGCTGGATTAGCTGTAGTAGTTTCAATTAGAAATGTCCTTTATAGGCACTGA 
TATTTGAACACTTTGTCCCCAGCTGGTGGTGCTGATTGGGGAAGTTCAGAAGA 
TGCAGCCTTGCTGGAGGGTGCAGACAGACTTTGAGGGTTTATAGCCTTGCTCT 
GCTTCCAACTTACTCTCTTTCCGCTGTGCTTACAGTCAAAGATCTGATCTGTCC 
GTGGTTAAAGATGTGAGCTCGCCATGTCTTCCACTTGTTGCTATGCCTTCCTGC 

20 GATGATGGACTCGCATCCCTCTGGAACGGTGAGCCAAAATAAACCCTCTCTCC 
CATAAGTTGCTTTTGGTCATGGTATTGATCACAGCAACAGAAAGTGCACTGCA 
ACAGCTGCTGAGCTGCATCTCAGCTCCATGTTTGCATCTCTCACATTTCCTTGC 
CATTCATTTTCTTCTTTCTGTTTTTTTAAAATTAATTAATTAATTTTACACTCCA 
TATTTTATTTCCCCCCTCCATCCTCCTACTGATCCACATCCCATACCTCCTCTC 

25 CATTTCCCTGTCTCCACGTGGATGTCCCCACCCCCTACCCCACCTGACCTCTAA 
ACTCCCTGGGGCCCTTAGTCTCTTGAGGGTTAGATGCATCATCTCTGAATGAA ' 
CACAGACCTGGCAGCCCTCTACATGTACGTACATGTATGTCAGTGGCTGGTGT 
ATGCTGTCTGTTTGGTAGTCCAGTGTTTGAGAGATCTCGTGGGTCCAGATTAA 
*• TTGAGACTGCTGGTCCTCCTACAGGGTTACCTTCCTCCTCAGCTTCTTCTAAIf: 

30 CAACAACAGGGGTCAGTTGCTTCTGTCCATTCACTGGGTACAACTATCTGCCT 
CTGACACTTTCAGCTGTTTGTTGGGTCTTCTGGAAGGCAGTCATGCTAGGTCC 
CTTTTGTGAGGGCTCCATAGCCrCAGTAATAGTGTCAAGCCTTGGGACTTCCC 
CTTGAGCTGGATCCCTTTTTGGGCCTGTCACTGGACTCCTTTTTCTCAAGCTCC 
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TCTCCATTTCCATTCCTGTAATTCTTTCAGACAGGAACAAATATGGATCAGAG 
TTGTGACTGTGGGATGCCCCCTCCCTCATTTTATGCCCTGTCTTCCTGCTGGAG 
GTGGGCTCTATAAGTTCCCTCTCCGGGCATTTCATCTAAGGTCCTTTGAGTCCT 
GAGAATCTCTCACCTCCCAGGTCTCTGGTGCATTCTGGAGGGCCCCCCAATCT 

5 CCTACCTCCCGAGGTTGCCTGTTTCCATTCTTTCTGCTGGCCCTCAGGGCTTCA 
GTTCTTTTCCCTCACCCAATACCAGACCAGATTCCCCTCTCCCTCCCACTCCAC 
CCCCCCCCCATCCACTTTTCCTCCCGGGTCCCTCCTTCCCTCCCACTTGTGATT 
GCTTTCTTCTCCTTCCCAAGTGGAACTGAGGCGTCCTCACTAGGGCACTTCAG 
CTTGTTGACCTTTTTGAGTTCTGTGGACTATATTTTGGGTATTCTGTACTTTTTT 

1 0 GTTTTTTTTTAGGCTAATATCC ACTTATGATAGTATACATAAGCGACCTCAAA 
• AATTCTACCAGAGAACTTCTCCAGGTGATAAACAACTTCAGCAAAGTGGCCG 
GATATAAAATTACTCAAATAAATCAGCAGCCTTCCTTTATACAAATGATGATA 
AACAGGCTGAGAAAGAAATTAGGGAAACAACTCCCTTCACAATAGCCACAAA 
TAATATAAACTATCTTGGGGTAACTCTAACCAAACAAGTGAAAGACCGGTAT 

1 5 GGC AATATCTTC AAGTCTCTC AAGAAAGAT ATCG AAGAAGATCTC AGAAAAT 
GGAGAGATCTTCGATGCTCATGGATTGCCATTCATTTTCTAACCAAGTTCATT 
TTAAGTTCCTAACCATTCAGCGAAACCACTAGACATAGTCCGAGAACTGGCCT 
ATAATCCAGATCTGGCCGTTTCTCTTCCCAAGCCAATGGCATAGCAAGTTCAC 
TGGCCAAGGCTCTGCGCACTGAGAGGGTCCCCTCCCCACTGTCCTGGAGTGTT 

20 GTTCAAAATGGAGCAATAGTGCAGCTGTCATTAGAGGGTGTGC.CTGCACAAC 
CCTGTTGTCTTACTTAGGATTTCTATTGCAATAGAACACCAAGACCAAAACAA 
CCTGGGGAGAAAGAGGTTTATTTCATCTTGCATCTATTTCCATGTCACAGCTC 
ATCAGTGAGAGAAATCAGGGCAGGAACTCAAACAGAGCAGGAAGCTAGAGG 
CAGGACCTGAAGCAGAAGTCACGGAGGAGTGCTGCTCACTGGCTTGCTCATC 

25 CTTCTTTCTTATACCATCCAGGACCGTCCACCCAAGCGTGGCACTGCCGCCGG 
TGGGATGAATCCACTTACATCAATCATCAGTCAAGAGAATGTACCACACAGG 
CTTTTCATGGGCCAGACTGTTGGGGATATTTTCTTAATTGATGTTCCCTCTTCC 
AAAGTGATTCTAGCCCATGTCAAGTTGACATAAAACTAGTTCATGATGGACA 
GCTCAGGATGCA<.AGTAACGTGTGAGGCGTGTTCCTATGAATACCCAGTAGC 

30 AGTCAAGATCAAACTCTCATAAGGGCAGGGCCTCTCTTACAATCAAACCCTC 
ACAATCTGGCAGGACAGTCTGCAGCTATTTCCTTGTGGAATCAAGCCAAAGC 
CTCCTCCACTACAATCCCAGCATAGAACCCTCATGGGAGAACATCTAAACCTT 
TTGGGAATCCTTCCCTGTTTTGTGCCCCACACAAATCTAGAGGCCATTAGCAT 
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AGCTTAGTAAATGCTTAAGCAACACACCCAAGTAAGGAATATGAATAGAGGC 
TGTGCTTTTTTTTTTTTGGCTTGAGAAGCATGGGTCATGTGCAATGACTTATCT 
TTCCTTGTAAGAGATGtGTTGACGTGCCTAGATATC'TTATGTTAAAATTCATTT 
GAATTGACTTCCTGTCCTGACACATCAGTGTCTTACCAAGAAGTCAGTAGTTG 

5 CTACTTATTTTTCTTAACTATCAATTCTTTGTGAATTTCACATCATTCACCCCA 
GTCCCACTCATCTCCCTCCTTCATATCCAGTCTCTGCCCTTGCAACCTCCCTGC 
AAAAAGAAAAGAAAAGATAGAATTTAAAAACATCTTGTTGCAGAAGCTGTAG 
TGTGTCACAGTGTGTCCCACAACATACCCTTTTGTCTACACATCTTTACTTCCA 
AATATTCACTGCAATGAGGCATTGGTCTAGTTCAAGGTTTCTGGCTTCTGTTA 

10 CACCAATCAATACTGTATCCTCATGGGAACTCCTCTTGGCTATCCTGTTGTTGC 
CCTGTGTCATGGAGATCCCACAGCTTAGGACCTGCAGGATTGACCTTTTCACA 
CACTCTAGCAGATCAAAACCTTCCATGGGCTCATTTTATGTCTTACATTCTGTT 
TATTTTAATGCTCATCAGATGAAGCATCCTGCAGGGAAACTTGGTGGTTAGTC 
GCTGTCGACTCTAAATTGTTAGATTGTGTGTATGGGTTAGCTTAGGAGGGTTT 

15 GGATATATTGGGTCCTGAGAAGAGACTAGCTTGACTAGCAAGTGTTAGCAAT 
GGACTGATGTTTTCTGTTCATTAGTAACTGGACTTAAGCTTGCTCAGGGCTCA 
TTTGGAAATGAACTATCAGATTTTATTTCTAACAGATGTTTCCAGAGCG^^ 
AACCTGGGCAGGAGTGAGCACTTTCCAGCAAGTGGCTGAGTACTAAAAGGTG 
GCCCAGTCTAAAGAAAAAGCAGTGTTGGCTGCCTGCCTGCCTTCCCTGGGTCT 

20 GGAGCCAGTGTGTCCATCTGCTGCCACTGTCTTCTTTTGACGTCAGACTGAAG 
TTTCTTCAGCCTTCTCTAACAAGAACGAAACACCAGTGACTCTTCAGGAATTT 
TCCAGGCCTTCAGCACCAGACTGAGACTGCTGAAGCGATCAGCTTTTTGGACT 
AAACAGCCTGTTTAGTGTTGTGTAAGTCAATCTAATAAATCCTCCTTTATAAC 
ACACATTTCCTCTATTAGTTCTGTCTCCAAAGAACGCTATTTCATCCATATTGA 

25 GGTACCAGTAGTAGGTTCTAGAGAGGCAGAACTCTAAGGATGAATTTCTCTA 
GTGGGCTTGATGGTGTTTGACTTAGCTCTCCAATTCATCTACACTTGGAATTG 
CAAAGACTCTACACTTCTCTGGTAGTGTAGAGAGAGCATTACCAATCCGTCTT 
AGGCGTCCAAGAACTGTACAGTGAGTTCACACATACACACGCATACATGTGC 
ACACATGTATAAATAAACAAACAAAllAAATAAATAAATGtAACTTAAAAAT 

30 TAAAAGGAAAGAGAATGGCTCTTCCTTCCAAGAATGGATGTGGGCTGAGAGG 
TAGCAAAGGTTTGAATCTCTGATGGTGCTGAGCCAGAACAGTCATGAGCCTCT 
GTGGGTCACATAGTACTCACATAGTACCCACATGCTCCACATTTGGAGCATAA 
AGGGCAGGAGCTTTCCTCGGGGGTGCTCTGTGCCTGGTATGTGTCCTTTGTTA 
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TATGTTTCCAGCCTTCTGTTACCATTGGTCTCTTCTATATGCTAATCTTCATCCT 

cctttcccccaacatgccatgctaatggacagcagagaccaaacagtgtcca 

TAACTGATCCAACTTGGGTATCTCAGTGGTGTGTGTCCCTAAAAGTGCAATAG 
ATGGGGGTTGGGGGAGGAGCTTACTAAACACTACTCCATCTGTGTAGACAAA 

5 TTACAGAGCAAATGAAACATTTACCTTGAAAGACTGGAGAGGCAGCTCAGTG 
TTTAAGAGCACTGGCTACTCTTCCAGAGAGCCTGGGGTTCAATTCCCCCCCAC 
CCATATGGCAGTTCACAACAGTCTCTAATGGCTGTAGTCCTATGGGCTCCAAT 
GTCCTCTTTTGACTTCTGCAAGCACCAGGCATGCAAGTGGTACACAGATAGGC 
ATGCAGGCATACCACGCATACACATAAAATAAATAAAAAGATTTTAAAAGTT 

1 0 TAGTTTGAATTGTAGC AACAGAGAATCTCAGTTCCTCAACCAGTTCCCACACA 
AGTCATCTTATAGACAGAACCCCCTGACAACTGTGTCACACTTCCAAAAAGTT 
TTATGCCAGGTGGTTGTAGTGCATACCTTTAATCCCAGCACTCAGAAGCAGAG 
GCAGGTGGATCTCTATGAGTTCAAAGCCAGCCTGATCTACAGAGTGAGTTTCA 
GGACAGCCAGGGCTACACAAAGAAACCTTTTCTCAAAAACCAACCAAACAAA 

15 GTTTTATAGTCAATCAAGCCAATCCATCTTTCTCCAAAAGAACTCATGAGATT 
TGATTAGGGTAACTGTGCATCCAGGAAAAGGAAAAACAAAACAAAACAAAC 
AAACAAAAAAACTAGAATTTTTATGGCCTACTAGATACTGGCTCTGAATTGAC 
ACTGATTCTGGGAGACCCCCTCCAGAGCACTGTGGCCCTCCAGTTAAGTGGGT 
GCTTAGAGATGTGATTAATGGAGTTTCAGCTGATGTCTGACTCATAGTGGATC 

20 CATTGGGCCCAGGAACTCACCCCAGTCCCAGAGTGTATCATTGGGACAGATA 
TATTTAGAGCTTGGCAAAATCCCAACATCGGTTGTCAGATGTGTGGAATGAG 
AGCTATTATGGTCTAAAAGGCCAAATGGAAGCCATTAGTTGCCTCTACCAAA 
GAAAATAGTGAATGAAAGAAGAAAATTCCCTGGAGGTATTATAGAAATTAGT 
GCCACCACTAAGGCCTTGCAAGATGCTGGGGATGGAGTGGGGTGAGAAAATG 

25 GCTCAAGGGTTAATGGCACTTAGCTGCTCTGGCATAGGACTCAGGTTTGAGTC 
ACAGCACCCACTTGGTGGCTTATGGGTATCCATAATTCCAGTTCCAGGAGGCC 
CCTTCTATGTGGTTCATAGACACACTTGTAGGTAAAACACCATACACTTTTTTT 
TTAATGAGAGGTACTGGAGAGGTGAGTTAGAGGTTAAGAGCATTGGCTGCTC 
TTCCAAAGGACCCTAGTTCAATTCTCAGCATACCATAGCGCT(2CTCACAACAG 

30 TCTGTAGTTCCAGTTCCAGGAGATCTAATGCCCTCTCCATATGGGTACCAGGC 
ACACATGTGGTGCAGTCATATGTGCAGGCAAAACACGCATAAGCAAAGAGGT 
ATAATACATAAACAAATCTGAAAAAATTAGAAAGATGCAGGGGTGTTGGTTA 
CACCTACATGTCCCTTTAACTCTCCCTTTTATTCTGTGCAGAAGACAGATGGA 
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TCATAGAGAAAGACAGTTGACTATGGCAAACTTAATCAAGTAGTGACTCCAA 
TCACAGATGCCAACTTCCTTATTTGAGCACATTAACATATCTTCTGGCATCTG 
GTATGCAGCTATTGGTCTGGCAAGTTCTTTTTTCTTGAAACCTGTCTGCAAGG 
GCAACCCAGAAACAATCCGCTTTCAGTTGCCAAGGCTAGCAGTATACCTTTAC 
5 CTCAAGGGTATATTAACTTTCTAGCCCTGTGTCATAGTTTAGTTTGCAGGGAT 
CCTGATACACAAGGTATTCACGTTGGCCCATTACATCGATGGCATTACATTGA 
TAGGACTAAGTGACAGGGAAATGACAGGCACTTTGGACTTGTTAGTAAAACA 
TTTTGGATTTTGGTGGCAGAAAGCTCCGTACTTGGGTGTGTTGCTCCAGCCCA 
TTCACCAAATAAACTCAGAAAGCTGCTAGTTTTGAGTGGGACCCAGCTGCTGT 
1 0 GCCAGTTGCTCTGTCACTTGGATCATATGATGCAGCCCACTTGATGGTTACTT 
AAGGTGGCAATGGCAGATAGGGATGCTGTTTGGAATCTTTGGCAGGCCCTTA 
TAGGTGAGTCACAGTGGAGAGCCTTGGGATTTTGGAGCGGGCCCTATGACCA 
TCTGTAGACAACCATCCTCCCTTTGTGGGATAACTCTTGGCTTGCTATTGGGC 
CTTAGTGGAAACTTGACATTTAACAATGGGATGTCAAGGTCTGTATTTATGTT 
1 5 ATAGTTTCTCAGCCCTAAATGGGATATCCTTAATGTCGATTACCCAGGGCCAG 
GGTTATATGCAGAAGAGGTGACTGAAAGATTTTAAGATCCAGAGGTGGTAGA 
AACAGGGTTTTCCAGACAAGAAGATGGAGTCACATATGAACTCACAGATACT 
GTGTGACAGCACACACAAGATCTACACAGGTTCGAATTAGACATTGTCCCAG 
CACTGAGAAGGGAAAGTGGACCAAAGCTAACCAAGAAGCTATTAACAACTG 
20 ATAGGGCTGAGAAAGGGATAATTAGTTTTCTCCAATCGAGTATAGGTATATC 
AACCACATTCCAGGAGAGGCCCCATGCCCAGGAGTGGTTGGCCAACACAAAT 
CAGTCTCTAGTTGTTTTTGTTGTGGTGGGTTTTGTTGCTGTTTATTTGGTTGG^ 
GGTTGGTTGGTTGGTTGGTTGGTTGGTTGGTTGTTGTATCATTGGAGAGCTGG 
ATTTTATTGAGCTGAGTGTTGCCCTGAAGAAGCAGCTGTAGTACTGAAAACCA 
25 AGAAACGGTGTTGAAGGCAGGGAGCAAAAGGTCTGAGCAGAGACTGAATGG 
ACAGTGGGAGCTGTGAGAGAGGCCCAGCCAAGGACTTCTTGGAAAAGATGA 
AGTCTAGCAAAGCAAATCACAAAACCCCCCAAACAGCACTGGGAAAGGACA' 
AGGGGACAGAGAGATAGCAGGTTGAATAGAACTAGCGGTTTCTTAGGAATGA 
CTTl^AAATATGTAGCTGCGCACATCCAGCAAATGCAGTCTGGACATTTGTCCC 
30 AACGAAATGAAACTCTGTATCCACTAAGATGCACATTCACGCCTGGAGAGTC 
AGGCTCTCCTCTCACATCCTCTCTGCCAGCCAGAGGCTTACCCTAAAGCCCAG 
ATTTCCCTTGCACTTAGTCTGTAGTCATGTTGGTCTTCAACGCATAGGAACTCT 
TCTGCCTCAGCCTCATGGACGCTAGGATCCTAAGTGTGAGTCCGTAACATCTA 
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TGTTTTGTTGACAATGTCATGCATGGCTACTGTATCTATATCATTTTTACTTTT 
CCAGCTCCAGCTCCTTCTGCACCCTATTCCGCACCCCCTCAAATTCGTGACCTC 
TTCTTTAAGGATTATTGTTTTATATGTAGGTGCATAAATATACATGTATTTTTA 
AGTAAACAGAATAATATGTTTCTCTATTACCTGTAAAAAAAATTCTCC 1 1 1 1 i 1 
5 TTCATAATAACCCCATGTGTAATAGATGAAAATGTGCCTTAAAAGATTAATGG 
TTGGCTAAGTATATTCAGTACAGCTGCTATCCCAGAACTTTGTGGGTAGAGGC 
AGGAGGATCAGTTCAAAGCCATCGTCAGCTACAGAGGGAGTTCAAGGCCAGC 
CTGGGCTATAGGAGACCCTGTGCCAGTCCTTCCCAAAGGAAGATGGAGAATG 
TAGAGATGAGGAGCAGATTAGTGGTTGTGTAAGAGAGGTGGGAACTGTGGCT 
10 TTATTTGGGAAGCATCAGGGATAAACACACATGCACACACACACACACATAC 
ATGCACACATACACATATACATGCACACACATAGCACACATATACACATACA 
TGCATACACACATAGCACACACATACACATACTTGCATACACACACAACACA 
CACATATATACATATATGCACACACAGCATACTCATATACACATCCATGTATA 
CACACAGCACACACATAATACACACACATACATACATACATACATACATACA 
15 CACACAGCACACACACACATTCACATGTACATGCACACACAAATACACACAT 
ATACATGCTTGCACACATAAATACCCACGAATGCAGGTGAATCTGCTGAAGT 
CTGAGTCATCTGTATGCATTGGACAAATGTCGATTCTGCTTGTTATCATGGTA 
TGTACGCAAGATGTCTGCCTTGGAGGAGGCCTGAAGGAATAATGAACTAAGA 
CTTCCTGTGTACTTCTTGGGAATCTATAATTGTCTCAAAATATAGTGTTCAAGC 
20 TAGACTTCAGGCCATGAAAGTTGAAAATGTCAACCAAATCAAAATATTGCTG 
GAAAAAATGTAAATGGACAAAGTTAACTCAAAATAGATAAAATGGGGCTGG 
AGAGATGCTTCAGTGGTTAAGAGCACTTGCAGCTTTACAGAATACCTGAGTTC 
GGTTCCCAGCACCCAGGTGGGGCATCTCACCACTCATAACTCCAGCtCTGGGG 
AATCTAATGCCTAGGGCCACCTTTGGAGGGAGTCAGGCTGTGAGGTTTAGGG 
25 TTGCTGCTTCCTGTGTTGTCCCTGTGTTCTGGTCCATGAAGATAGAGCGAGTA 
GCTGCTACTTGCCCACTGCAGTGGTGCCAGTGTGACACCTTCCCTGACACATG 
CTCCATCTTGACAGACTCTGCCATCTTAATCCATGAGCCAGCACAAACCCTTC 
TTTTCTTCTTGCTTTTTCTCAAGCACATAGTCATGGCACTAAAAGAAATATCTA 
ACAGAGGGGGAAATGATaAACATTCACACAGGAAAQGGAAATGTGCAGCTG 
30 ACCCCTGGTGACTTCAGCAGATACTTGGTACCTGGGCGTGAGCTGTGAGGAG 
CATGGAGCCTCATCAGGGGAACCCACACTCTCACCCTGTGACAAGGAAGCAG 
- AAAGAAGGGAGTACTGCGGAGAGCGCCCAGGCCCCAGCTGCTGAGCCAGCCT 
CCTGCAGCAGGGCCGTGTTGAGGGGACAGGGTTTGAGGGTAGGTGAGAGAG 
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GAAACTGGCATAGATGTTAGACAGAGGAGGCATGCTCACGGTGCAGCTTGGT 
TGGGTCTGGGCTTGAGGAAATCAGCAGCAAGAGCCATCTGGGGCCATGGGAA 
AACTGAATTTGGGTTATGTTTTCAATGTGACTAGAGAGTTATTATGAACATAG 
CAGATGTGACAGTGATGGATTGCAGCTCTCTGGGAGCACACCCTCCATATTTG 
5 GAGGTGACTGCTGGATTTGCTAAACGAACATATGTGTGTTTACAATTTACTTT 
AATCCTAAATGCTTAGTTGCCATTACATTTAGGAACACAGACATACGCAGAG 
AGAAAGCGAGACAGAGAGAGACAGAAAGACGACAGACACAGGCAGAGAGA 
CACAGACAAACACAGACACACACATGCACTTACATGCATACACACATACCCA 
CACACAGAGAAACACACACATGCTTTCTTGAAGGGCTTGGCACCTATCACAG 
10 TGAGCTGGCAAATATACCTCTGCATATTAGGCCAGCAGACTGGAAACTCAAG 
TGCGGTTTTTTTATGGCTCAGGCTGACATCAAATTGCTTATCTGGGAAATGAA 
AGTGTCTGCCCCTCAGGCTTCCATACACTGGACAAGATGCTCTGGGGATCACC 
TGATGTCCCAGGTTGATTTCTATTGGTGTGACAAATACTATAACCAAAACCAC 
TATAACCTAGGGAACAAAAGATTGATTTCACCTTACAGCTTCTAAGTCCCTTA 
1 5 CGGAGGGAAGCCAGGCAAGAAACTCAAGGC AGGAGCTGAAGAACAGGCCAT 
AGAGGAACAATGTTTATTGGCTTGCTCCTCATGGCITGGACAGCCTGCTTTCT 
TATACAACCCAGGACCACCTGTCCATGAATGGTGCCACTCACAGTGGCCTGG 
CCCCTCTTCAATCATTAATCAAGAAAATGCCCCACAGAAATTGCCTACAGGCC 
CATGGGATGGCAGGTCCTTCTTTCCATGTGACTTTAGTTTGTGTTGATGAAAT 
20 CTAACCAGCACCCTTGCCTAGTTATTCCTCACTTGGATAAAACATGCTCACAG 
CAGCAAGGTTTCGAACCCGCATCTGGGCAGCATGTCCTGGGCCAATAGTTACT 
TCTCTCTGTGTGTTGTGGCCAATAACTGACAAACATCAACTTAAAGGAAGAA 
GGATTTGTTGTGGCTCACAGTTCAGAGAATGTAGCGTGTACCTTGACAGAGA 
GCCCCTTGGTCCATGGTGCTGGCTAGAC AGCAGAGAGATGGGAACTCCCAAC 
25 TCAGCTGGCTTTCTCCCTTTTACTTTTTATTTAGTTTGGAAGCCCGGCCTATGG 
GATGGTGTGGCCACATTCAGGGTGTATCTTCTCTTGTTAGTAACACCAGAAAC 
ACCAAAGGTACTTCGTTAATGCCCCCCAAGTGCl^ATTTTTAGTTTGTATTTA 
TTTATTTGTATGTGTATGCCATGAAGAGAGTGGAAGGAGCCGCCCATGATGG 
CACACAGCTTTAATCCCAGCAATTGGGAGGTAGAGGTAAAGGGACCTCTGTG' 
30 AGTTCCAGGATGTTTATGGGCAGCACTGTGATCTCTATGGAGATACACTAGCA 
TCCCCCATGAACCAGCAAGACAAGATGCATTATCCCATCCATCAGTTGACAG 
GAAGCACAGCAGGTGGCTTGTGAAATGGAAACAGACATGGCCAAAACTTGTG 
CAAACITAGTTAACCAGCAGAGAAATGGGAAATCAAACCCAGAAAGCAATA 
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GCAATTTATACCAATGACTGACATACACAAGTTCTAAATGTGTAAGTAAGAG 
CACATATGGACCGGGGCACCCGCATGTGGGTGGTGCATTTTCAATCTGTAACC 
AACACTTGGGAAACAATTGGCAGCTTCCTTTGGGGTGCAAGCTGACACCTGC 
CAGTGACTCAGTCTCCTGTTGTCCACCACCCTGAAGGAGGTTGCTTCTTTAGA 

5 TTTGTGAGATGTTCTGGAGGACTTTCCAGTTGTCTTAATGTAAAGCCACTGGC 
TCCCTTTTATCTGTGTTGACATTCTTAGCAAATGATTTTACCCTGTGCCCTATA 
TGCCTTTATTTCTTGCTTTTCTCTGGCCAAGCTATTTTTTTCCAGATCATTGTAT 
CCTGGGATCCTTTGCTCCTGAGCCTGTTAATCTGAGAGAAAGCACTCATTGAA 
ACTGTTCTTGACTCATTCCTGCTGAAGTGACTAATGCCTTGACAAGCAAAACC 

10 ACTTAGGGAAGAGAAAGGTTTATTTGGCTAGAAATTTCAAGTCACCGCCTATC ; 
ATTTCAGGGAAGTCAACGCAGGAATGTAAGTAGGTGGTAACATCACATCCAT 
AGTCAAGAGAACAAGGAAATCAACTCGTTCAGGCCTCCTTGGTTGATAGCCC 
TCAAATAGTTTTCTCTCTCTTCTTATTGTTCAGGATCACCTGCCTAGGGAATGG 
TGCCGCCTGCAGTGGTCTGGGTCTTCCCACACCAATAGGCAACCAAGACAATT 

1 5 CCTC AC AG AC ATGTCTACTGGCCTACCTGAGCTAGATAATTCCTe ATTACAAC 
CCTTGCCAAAATGCTTTAAGTTGGCAATTCAAAGTAACCAGCATAGGATTGCA 
GAGGGAGCTCAGTAGCTAGGAGCACATCCTGCTCTTTGAGAGAACCAGAGTT 
TGATTCCTAGCACCTACATTATGTGGCTTATACCCACCTACAACTCCACTTTCA 
GAAGATCTAACTCCCTTTACTCGTCTTTGTGGGGTCCCACATATATACGGCAT 

20 ATACACATGCGCGCGCGCACACACACACACACACACACACTGGTGTTCAAAT 
GCCCGAGTCTATGGGATGTATCTCTCATTGAAACCACCAAACCAACCAGTTCA 
AACACTTATGCCACATCTTGAATGCTGTACCAGGTTGAGATTCTTCCAGTAAA 
CAAACACGTTCTCTACTCTTACTTCACTCTCCATAGAACCTCAGCTCATAGAC 
AAGAGCCATCCAAGTTCTTTGCTGCAAGTTAAAGATGCTGGTCTCAGTTTCAA 

25 GTGCAATCCTCATTTCTAACAGAGATCTCATTAGACTGGCCTTAACTGACAAA 
GTTTCTTCTGCCTGCATCCTAATTACAACCATGTAGTCAGTCCTTAACATGTCC 
TTGCTCITCCTTGATCTTCTGTCTTCTGTGCCTTTACCAGAATTATCTACATTGC 
CCCACTCAAAACAGTTTAGTCTCTTTTCCTTAAAAAAATTCTAATTGTGTGTAT 
ATATGGTGTCAGTGTATGCATGTAAGACCAGAGAGGGAGTTGAATTCCCrAG 

30 AGCTGAAGAAAGGTATGAACCACCACCACCCAGCCTGAAAATAGATCCCCCC 
TCAGTAATATATTATGATTGGTTTCCTTTCATTCCTCCCAGTTCCTCTTCCCCA 
ACCCCTATCTAGAACCGCCCACTTTCTGTCTCTCTTTAGAAAACAAACAGGCT 
TCTAGGGGATAATAGTAAAAATATAACAAAGTACAATATAATGAGATAAAAC 
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AAAAATTATGTCATTGGGGTTGGACAAGACAAACCAACAAAAGGAAAAGAA 

CCCAGGAGAGGCACAAGAGTCAGAGACCCACTCATTTGCTCACTCAGGAATC 

CCACAAAGCTCTGTGGCACCATTCCCCTAGCATATGTTGCAGCATGGACACCA 

TTGCAGATCAAAGGGTTTGTAGCTGGGTCGGTATTGATGTTTCTTTTTTGGTA 

GCATGCAGAGTACCTCCTGTACTAAAGATGCTAGGATGTGGGGATGAAGGCT 

CTTATAGACACTAGCTTGACTTGTCCATATTCAATGAGTTCTGTAGGTGTTGTC 

TTCAGCAATAGGGTTTTGCTGTCTCAAGTTTTTGGGGGAGCAACCTACTGTTT 

TGGCAACAGCCTGGGTTGTTTGGGGTTTCCCATGGGACCCCTTTGGCCAACAA 

CTTAATTAGATGTAACCCAATTCTGTTATTGGAAGCTTCATTTGGTATCAAGA 

GATGGCCAGATGAGACTGTCTCCCTTATTATTTGATATTTCACTTAGATTGCTT 

TCATATGTTCTCCCTCATCCGCTTCTCTCCAGCCCCTCCCCACTTGATCCTCTT 

GTTTC AGCACCTCCCCC ATCCATTCATAACTATCTTTTCTACTTCCC 1 1 i 1 1 AA 

ATGAGAACTACTTGTTAGTACCAGTTTTCTCTATCAGTTGGCATTACATCACG 

ATGATGAAATACCTGTGCTTTTGAGGAAATTAATTTTGTTCATGATTTTGAAA 

GGGCAAGGTTAAGGGGCTTTCCTCTGGTGATGGCCTCCTTGCTTCAGAGTCCT 

GGGACACCAAAGGGCATTCCCACAACAAGAAACAGGATGAGTGTAAATCAG 

TCAGGAAACTTAAGGGGCCTTACCCTTCAGACCCTATTATCACATCATCTTGA 

CCTGTGGGCATGACTGTTAGGGGGTTTATGTGGAAAGACCTGGCCCACTGTG 

GGCAATACCATTCCCTAGGCAGGTGGTCCAGAACAATAACAGGAAGGACAAG 

GTGATCTGAGAGCAACCAAGCAAATAGACCTGCATACATTTATTTCTCTCTGT 

TCCCAGGGGCCCTGTTTCTGAACACCACAGTCAGATTAAGTTTGTATCCTTTT 

AATCCTCAGAACGAAGATCAAACTTGGAGACAACATTAAACTTCAATTCAAC 

TGCATCCAAATCATATAATTACTAAAGGACTAATCTTAAAGCCAAAGGCAGA 

AGAGCACGGGGAGGCTGCCCTGGGATTCACTGAAGTTGCCATTCATACAGCT 

AATGATCAGGGGCACTCTGTGGAATGTGACTTCTTAGGAATCATTTAAGATCG 

CATTGGGAGATTGCATTGCAAAATTACATGGTGTCCTGTGGCATATAATATGT 

GTAGCCAATGTGTGGTGATGCTTTCCCCGCAAACCAGAATCCCCCAGCTGGA 

AGCCTCAGGGGTAGCTGTGGGGGCACCACTTCTTTCAATAAGAGCGAATGAA 

CCATTCTAGGAGTTATGTATGCTTCACGCCTTCAAAGACATTITCCTGTGTTGG 

AACTCCTAGTAGCCAACAAAGGAAGATCCTGGTCAGAGAACAGAATGTGCAG 

CAGAGCTGGACACCCGGTGGCACCTGGCTGCTTTGGGCTCCTCCATGCTGGGT 

TAAGAGACCCACAAGAGTGCTGGCTGAGTCGTGCTAAAATGAAGAACTAAGG 

GTGCTGCTGATGCTCATTCTAGGCAGGTTTGGGTAGTCTGGGCTCTGTTGGCC 
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CTCCACAGCAGTAATAAAAGAAATAGCAACAAGAGTCAAGAAGAGATTCTCC 
CAGGAATGAAAAACTGGAAAACAAGACAAACCATGAATGGTTAAAGCCATG 
CAGACTGGCTCGGTAATGAGCTGCAAGGATGTGCATTCTCTTACAGCATCCAG 
TGGGTGTTTTTTGTTTTGTTTTGTTTGTGCTGTATTGTGTATATTTTACAATCTC 
5 TATCTCACTTTTCTTCTTTTTTCCCCTCCTGTCCCACCACCATTGTATATGGGAT 
CCTATATTCCTTTCTCCTTCCCTCCTTTGCACTCCCTTTCTGTGGTTTTGTTTTG 
ATGTTGTTGTTTCTTTGTTCATTCATTTGTTTTTGAAGTGGAGTCTTTCTGTTTA 
GTCTAGCCTGGCCTCAAAATCACAATCCTCTTGCCTCTGCCTCTTTGAGTTTGA 
ATTACACACACACACACACACACACACACACACACACACACACACACACAAA 
10 CACACACCACCTTGTTCTGAGTGTTTGCTAATTTTTATCAACTTTATACAAACC 
TAGACATATTTGGGAAGAGGGAAGGAATTTCAGCCGAGGAGTTGCTTGGATC 
AGATTGTTCTGTAGGCAAGTCTGTGAGGGCATTTTCTTGATTAGTGATGACTC 
GGGGAGGGCTCAGCCTCCTGTGGGTGGAGCTCCCTCTGGGCTGTATAAGGAA 
GGAGGCTGTGCAAACCATGAGGTACCAACTAGTCAGCAGTGTGCCTATGTGG 
15 TTTCTGTTTGAGTCTCTGCCTCCAGGTTCCTGTTCTGCTTGAATTCCTGCATTG 
GCTTTCATTAATGATGTATTGTTACCTGGAAGTATAAGTCAAATAAATCCTAT 
CCTCCCTGAATTGGTTTCAGTGTCTAATCACCAGACTAGGGCACTCTGCATAT 
TTGTGCTTCCTTTCCTGTAGATTTCATAGGGCGAAGTACAGTGAGATATAGAA 
GCACTACAACCAAAGAAACACTTGCCAGCAGAGACCCTGGACTTCTATCCGG 
20 GGAACCAAATACTTGTGACACTGGAGAATCTCCCTAGGGAGAAACCCTGTGT 
GTTTGCAGATGTGGAGTGCCTTATTCAAAGAGAGAGTCCTCTTATTTGGGGAA 
GTGAATGTGTGGGAAGGAGGAGGCGTGAGCATCAGAAACACAAAGAAGGTT 
TAACAGGCCTTTCCCATGTGTTGGAAACAAATTGCAGGATGGCTCTGATTTGG 
GGGACCCTCTCTTGCTGGGTCTGGGTATAAGATGAGGTAGGTTCCTTCCTCAG 
25 GGGAAGTTTTCTTACGGGCTGCATTTGGGGCCTCAGTCTTGATTAGGTCAATG 
TGTCTCCTTGGGACTTAGCAGCTTGCACAAATTGTGTGAATATTTGAGAGTAT 
TCAAGTACAGGTGACAGTCTTCATAGGGGTGGGTGAACCATCAGTGACAGTA 
ACTACAGGAGGCAGGAGTTTTAGATTTTACCCCCACCATGCTCTGTGACAATC 
CCTCAGTATGTGGTCATGAGGCTACTGTAAACATGCTAAGAGTTCTCTTACCA 
30 TGCAAAATGGATGAACCATGTGGATCCTATGCTGAGATAGCCACTCCCAGGG 
GATCCTTGAACTCACCAAATCTACAGAAAGCAGGGCGGGAGCCAGGAGCTGG 
GGAGGAAGCAGGGAGTCAGTGCTGAAAGGGGACCGAGTTTCAGTATGAGAA 
GATGGAAAGTTCTGGAGGCAGATGGTGGGGGCGGATGTTAATTCTGCTGAGC 
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TGTGCACTTGAATGTGGTCAAATGGTCAGTGTCAGGCACAGTGCATTTTACCA 
GCATTAGAAGAAAGAATTCTTGTTCAGACAAGGAGATCATCATGTGGCCCTTT 
AAGAGCTTCACCAGATGGTGGGAGCCAGAGAATGAGCAGCATACAGGAAGG 
CTCAGCCAGCCCTATGGGGATGGAGCAGAGTGAACCAGAGTCACAGAGCAG 
AAAGCAGTCACCAAGGAGAGAAGGGCCACAAGCCCACAGCTGCAATAAATT 
AGCTTCTTCCAGCAGCCAGAGTGAGCCCAGATGGGGGAATCTTCTAGAAGCC 
TCTAAGTAAGGCCCAGCTAACATGTCCTCTTGCCTCATGGCCTCTTGGCCTCA 
TGAGAGGTTGAGTGGAGAACCTGCCCTCCATCTGGCTCTGACCCACAATCAA 
ACCTAGTGGATGGTTGCTACTTTGTTACTGTTATGAGAACACACATACACACA 
CCATATGTATATCACACACACACACATACATACACACTACACATATGTATGTA 
CACTGCACATACATACCACATACACACCCACACATACATACACAATATACCA 
TATAGCACACACCTACTCACACACTGTGCATATAACACCACATACACACTACA 
CACATACAGACATCATACACATACCACACCACCACACATGCACCCAGACACA 
CCACACACACACATACACACACTACACTTATACACCACACACACATATATAC 
ATACACTACATACACAGACACACCATACACACATACACACATACCATATACA 
CACCACATGCATTCATACACACACCACATACATACATACACCACACATACAT 
ACACACTACACATATACAACCACACACATACATACTGCACACCACATATACA 
CATACATCACACAGACATACACACCATATATACCACACATATACACAAACAC 
ACACCACATACACACACACACACACACACACAAACATACACTCATTGCTATA 
GGCATGCCCCCCCACTAGGGAAGTGTTCAGGGGCTGGATTTCTTGGAAGTGA 
TTAGATTGTGAGAACTGTGACCTAACCAGTGGATTAATTCATGGAGGGGTCAT 
AATGCAATGATATTTTAAAAAGAAGACTAGAAGGAGAGGCCAACTTTGAGGA 
AGAGGTCACTGGGCTGTGCCTTTGAGAAGGGATACCCTTCTCCCTTGGCCCTT 
CTCTCTGAATATGGTCCTATTGTGGGGCCTGTAGGAAGTGGGGTCTGGCTGAC 
AGAAGTAGATCAGAAGGGTGGGCTTTTGAAGGTTAACCTACCCCTCGTTCCGT 
TCTACCTGTTCTCTGCACTACTTTTCTCTGGGGAACATGAGCAAGAAACTGGT 
CACCCCAGGTGAGGCCATGGACAGTAGACCAAAGAAAGGATTCTACCCAAGT 
TTCAATTAGAATCAATGGGTTTATTAGGGAGAGGAGTTACTTATAAGGAGTGT 
GGGTAATCCCAAACACGCATCAACTAAAGGCCTCA^CTCAGGGATTCTGGAT 
30 CATGGAGTCCTGGCTTCAGCTAACTTGTCACAGTCCAGCATCTCCCAAGTCCC 
TAACAGTTACTGAGCAGGAGGGAGAGGTAGCTGGAATCCAGGTGAAGATAC 
AGTGACCTCTAAACTAGGAAGCATTAGCAGCCCCATGGCCATTTCCAAAACA 
TGGCTGTCACCATGGGGTTCTGCACATTTCAGTAGCATGACTACAGAGCCAAA 
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GTCATGCTTTTCTCCATGATGACCCACAGGAAATTGGTTGACCTCAACATCTA 
GCTTCAACCTTATGTTCCTGATGCCATAGACATAGCCACTCCTCCATGCCTCC 
CCTGTTGTGATGGACCAGAGTCTGTCTCTGAAATCATGAGCCAGAATAAATCG 
TTCCTGTTTCTGGTAAGTATCTGGCCATAACAATGTAAGATTAACTAACACAC 

5 CTGGTGGGATATTCTCCCCTTCTCTTGCCCCACCATGTGCAAGACAGAGTTTA 
TCTACTCCTTTGAGGATCATGTGAAGCCAAAGACTTCATCTCTGTGGGCAAGC 
AGGCAGCAAGCATCCTGCTAGACCCAATTGTCAGCAAACACTCATGCTTTAG 
AGGGTTGTTGAACAGTTAGCAAACATGTGCACTTCAGCAGGGCATTGAGGCA 
CCAGCCCATCCTAGGTCCATCTGAAGAGGCTGGGTTCCTGGGTCTGGAAACC 

1 0 ATGGG ATCTG AGAAGCAAAAGG AGACAGTGAGGTTGTTCTACATGTACCAGA 
AAGAACTGGGGATCTGTTCCTGAGAGTTTTGCAGGGACTATGAGGCTGATGT 
AGAACATTTGGAAAGAACTGCAAATCCATCCCAGAATGGATGAGAGGGGATG 
GTGAGGCTGCTTAGAATGGCCTAGAAGGTACTGTAAGAGGGACTGGGGTATA 
ATTTAGTCAGAGGCATGCACACATGCCCCTCTTCCTGCCCATTAAATAAATAA 

1 5 TT AATGAATAAATGT AGAAAAT ACTTGCCTTGCC AGC AT AAGGTTCTCAGTTT 
GATCACTAAAGCCATAAAAATGCCAGGCATGGTGGTGCATGCTTGTAATTCC 
AGTATTGGGGAGGTAGAGATAGAAAGCTCTCTGGGGCTCTCTAACCAGTCAG 
TCTAGTTTAACTGGTGAGCTCCAGGCCAGTGAAAGACTCTATTTTAAAGCAGG 
ACAGCATTCTGGAAGATGACAATCAGTATTGTCCACCAACATGTGTGTGCAC 
. 20 AATATGGGCATTTAAAAATATGTCCCTAAATGCTCTGATGGGAATGGAGAGG 
CTATTTTCAGAACATAGAGAGGCAATGAAGATGCTGGACATAGGGATCCAGA 
TGCTAGGGCATGGGGGGTGGGGGTGGGTAAATCTGGCCTACCACCCACCAAG 
GAAGGTCAAGTGCTCTGGGAACAGAAATTTCCTGGCTGCTGACACTGGGTCC 
TTCTTGACTCTGTGCATTAAGAGTCCTGGGTTCCTTAAGACCAGATGTCCTGG 

25 TCTGGGAATTCTACTGGGCAGTTTGGGGGTGGGGTCGGAGGGGGCTGGAATC 
TTCATATCTCTGTGTTGGTGGCAGGGTGACATCAGGTCTATCAGAGACTGGAA 
GTGCTCAGAAGAGAAGAGGGCATGTGGAGGCCACCCTTGTCTTCCAGGGCAG 
AGGTAACACATCGGTGGTGGGAGAGAAAGATACCCCAGGATGCTGTGGTGCT 

I gtacgacccttctggaaagagaggggcctaccttggtggtgtcagggagt.It 
30 ctgccttttctatccaagccatggtaggagtgaatggcaaaccacaggaagg 

CACAGAGCATCATAGCATATACCTGGGACAGTCAGAAGAGATGTCAAGGAGT 

caatagccaccaacaaccctgggggaagattcttaccacttccagggaagtt 

CCTAGGATTGGAAAATGGAGGGCAAGAGCTAGCCACTGGGCCGCCTCCCTGG 
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• GTCAGGGGCTGGTTCCACCCTGCCTACTGGGTCTGGAAGATCCTGGTAGTGCC 
ACGACTAGAGCATTGGCCTGGGGGCAATAACCCAGGGACTGCTTCTACCTGG 
GAAGGGGCAAGGCAGGGCAGGATAGAGGGAGGGCCAGGAAGGAACTGGTG 
GCTACCGTGAGGGCCAGGCTGGTGCAGGTCAGGATGGAAGCAATACTGTAGT 
5 GTGTCCACAGAACGTTCCTGATGCTCTGGAGGCAGTCCTGGACAAAGGCAGC 
ATGCATGGTTGCCTACCCTGGCTGGCTCTCCCTGCACAGTGTGCTCAGCTTCC 
CCCAGTTCCACGCTCCTCCCAGAAGCCTCTGGAGCTGCATGGTTTGCTCAACC 
AGTGCTGAGGCCAGGACAGTCACGGTATGCACCTTCCCAAGCAGAACTACTG 
AGCACTCTGGGAAAGAGGCTTCATGAACCTTGAGCCTGGGGGCATGGAGCCT 
10 GTCTGTGGGTACAGTGGCCTACTCCCTGGGGGGACAGGTAGGACCTTCCTCTT 
TTGTTCTAGCTGCCCCGAGCTCCTCCAGTTGGCCACTGTTTGTCCCCAGGCAG 
TCGGGGGAAGGGGGAATGTACCACAGTCCCCCAGAGGCTGCTGTGGAGGGTA 
GAAAATTTGATGCTGCAGGTTTCTTCATTGTAGGTATGACATTAACAGTCCCT 
CCCTCTCCTCGCCCCTCCCCCTTCCCTCCCCTTCCCTCCCCCTTCTCCTCCCTCT 
15 CCTCCTCCCCTCCCTCTTCCTCTTCCCCTCCTCCCTCCCCCTCCCTCTCCCTCTC 
TCCCTCTCTCAACAGGGAGGTGACCATTGGCATCTGGGACATTCATATGCTGG 
CATTTTGTTGGGTTTAGGTTGAAATGTTTGGGGAGGGTGAGGATGGGATGGA 
CCGAGTGGAGCCGCACAGGGTGCGCTCTATTGCATTTTGGCAAGCAAGTGGT 
CCTTTCGGCCCTTACTGGCTTGAGCTGGCCTTTGTTTCTCTCCCACCAGGGATC 
20 CTTTGCAACCAGCTGTGGTACCTTGCCACCCTGGGGGTAGGAGGCAGGCACT 
GAACTATTCTGTATTTGGGACCCGTGGACTGTTCTTGGGTGGATGGGGCTCCC 
TGACTCAGCCCCATCCCAGCAGGCCAGCAGCTCCCTTCCCTTCTGGGCCTTGA 
GCCATTCCCCACCCTCTTGTTTCTTGGAGGGTAGCCAGACATCTCTCCCCATG 
AGATAGGGCTGGTTTGTTTCCAGAACAGCAGGGGGATGGGACAAAGAGGGG 
25 GCTGGGTAAGAAGAGAGAGCCAGGAGGGGAGAGTGAGTCAAAGCCTAGCCC 
TTAGGGTAGAAGGGCTATTCCATGCTCTTACCCTAGTACTTTCTGCTCTm 
GATCCCTGGGGACCCCTTCCTACTCTCTCTAAATACCTGATCACCCTGTTATGT 
AGAGCCTTAGCTTTGAGCTTCTAGATGCTTCAGAGGGGAGGGAAGGGGAGGG 
GAGATTCTTGA|GGCCTGGGGAAGCCTCATAAAGGAGGAATGAGTGGGTGGG 
30 GCCCATGCCTGCTGCTAGCCTGCAAAAGAGAGTTGGGCCTAGGGTTAAAGTC 
AAGCTGGCCATCGCATTAGTTTCCCTCCCTGTGGCCCCCTATCAAGGGGTGGG 
GTGCATTCCTAAGACTCTAGGGTAGGTCCCAAGATGTGAGGGGCCAAAGAAG 
ATGAAGATGTCTCTTCAGAGGTAGAACCTGAAGTGCGCAGCCCTTACCAGAC 
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CGACCTGAGTCCAGAAGCATTTATCCCCCCCCTCCTCCTCACCTCTCTCATGG 
CCTCCCGGCCCTGACACATGATGGCTCCAGTGCTCACCAGAAGCCCAAAAGG 
AGATTTCTTCCCACAACACAGAAACTGCAAGACAGACCAGGGAGGAGCATAG 
CGTAGGTAGGACTGATTGATGGCAAAACTAAGGCATCAAACACAAGTACTTT 

5 GGGTGATTCTGCAAACTGGGCTCCCCAAAGACCAGAGCCAGGGCTTAGGATC 
ACAGCTGCTCCTGGAAAAATAGCCACCAGTGGTTTGGCCTGGTAATAGAATA 
GCCTGGCTTTCTTAGGACTACTTCCCCTTACAGCAGGGCCCTGCTACTTGCCT 
GGTGGCTAAACACCTACTTCCTTGAAGAATACCCAATTCCTTCTTGGAATTCT 
CAGTCCCTACTGAACAGGGGCATTGCCCTTTTATTTTGCACTAGCCATACATT 

10 GGGTAGCCAGTCCTATTTGGGGTGGAGGGGAATGGGTGGGAGCTCAGTCATC 
TACTCAAAATGAACCTGCGTTCTGAGACCTCCTCTTGAAGCCTTGCCAGGATG 
GCTGACTCTACAGGATTTCTGGCATCCTTTCTCTGGAGCACAGCAAGTATGAC 
TACCCTGTCCTTAGCTCACCGTGTCCTGGATGACGGCCAGCTCCTGCCACCAG 
TTGCTAGATGGGCTCTTCATTGCCTGATCATACACGAAGTCATAGGTGTCCAA 

15 CACCGCATCCTCCACCTGCACCAGAGCATCATAGGAGGCAATGCCCTGAGAG 
AGACCCAGGGAAGCCCACATCCTGGGACTCCTTCTCCAAAGACTTTCCTCCTA 
CCATGGCAATCCTGTGGCTTGCCTATTGCCCTTGGGGCTCCCAAAGGTTAAAG 
AAGAGGTCAGCAGAACTTAGCTACCCACCCTCAGTGGTAGCTTCTGGGTAGC 
ATAGAGTGGTATCCTGGTGCTACTGCTGGCCTGGATATAGGCAATAAGCCCA 

20 GCTTGCTGAGAAGTGTGGATAGAGGCCTGGACTAGTTCCTTGGTACCCTAATG 
TACTTGAGGGTCTTTCTCCAGAACTACCCAGAATACTGGGAAGATAAGCTCTA 
AGTCTTGGACCAAGGAAGGTAAAAGTATCTGGCATCAGATACAGGTAGCCAA 
GGAAGAGGAAGAAGGATGTTGGTCCTGCCTTTGCTCTACACACAGAAAGCCT 
AGACTGTTTTGCrCCTATCTCCTAGGTCTCTTTCCCTATTCTGTTCAATCCCTAC 

25 CCTAAGATGCTGTCCCCCAGGCCAAGTATGTTTTCAGAGACAAGCTGGAATTT 
AATCTCTTCCAGGAATGCATCCTGGATATGTGGGGCTCACTGGTCTCCTGACC 
TAGGGTAAGTCCTTCTACATCTTTTTGCCTTGAAGACCCTGAGTCCTCCCTACA 
CCCAGCCTCATGCCAGGCCAGCAGCCAGGTCACACAAATGGTTGTCCTAAAT 
GAGTGGAGTGGTGTTGAAGTCCTATA^rCCTCTGGTAACTCATGCTTTTCAGAA 

30 AGGCAGGGTGTCTGTTCCATGGCTTGGTGTCCTCTATGGGCCTCACTCGCCAG 
CTGCCTGCAGGTCATAGTGGACAGGTGGACAGTTAGAGGTCCTTAGTTAGTCC 
AGCCTGGGCCCAAGGCCCCTGCAGGAGCCAGGGGAACTCCTTGCTAACTGCT 
GATCCTGGCACCGATTCCTCCCTCTCCATCCATGCAACAAAGCCAGACATCCC 
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CATGAGTCATCCCAGCTCAGCCTGCCCCAGCCCTCCCCCACTGGAAGGAGGTG 
CCAAGCTGGCCTTGGTTGCTGGTCAGTCAAGCCCAGAACCCTTTCATGGGCTC 
AGCTTCTTTGAGCTTATGGGGCTACTGTGTCCTTAAGGGAACAAATCCTGGGG 
AAGGCTGAGAACTGTCTGGAGTTATTCTGGAATGGGTGGGGCAGGTAGCAGA 
5 GTCGAGCAGGCCCGATTGAACTAGCCCTTTTTGCATCACTCCCTAGCACAATT 
GACTTCATTTCCATGAGTTCTACCACTTTTCAAGGATGTCCCAGTGCCCCGGG 
GCCCTCTGTCCACTGTCATTCCTACAGGACAATAGGGATGACATCAGCTGTAG 
AGCTCAGCTCAGCTTGGTTCGCCCAGCATCCCACCCCCAATGTGCAGAACAAC 
AGAGCCCCGCCTAGACCTTAGAACCTGATCTTGGAGCTGTACAGACCAGGCT 

1 0 GAACTCTCTCTGCAGAGCCC AAGC ACTGGGGCAGGGGGTAGTGC ATGGCCGT 
CTGGGCATTTTGGCATCCAGAGTGCTCTGTAAGAGACTGATGCTGGAGTTTTT 
CATGGGAATAGGGAGGGTGGACCAGAGGCTTCCTAAACCAGGGCCACAAAG 
CCTCCACACCATTTGTCCAGAAGTCCCTCCTAATAAGAATACAGACTGCAGGG 
CTGGCAAGCAGCTTCTGGGTTACAAACTCTCCCTTGTAAGTGTTTTGTCCAGG 

1 5 GTAGAGGGTGGAGGAAACCCTGAGTGTATGAAGTCCTGAGTGGGTATTTGTT 
GTCAAGGACTACAGCCTCATGAGAAGTTGGCAAAGCCATCCACTGTGTGAGT 
CTCACTAAGAATAGCTAAAGCTGCCTTCTGTCCAGCTGTATACATAAACCCTG 
TCTTCTTGTTTATCTGTATGTAATAACCCTGCCTCTTTATTCAGCTGTATATAA. 
TAAACACACCGAGCTGCTGGGGAAAGCAGTTTTTCCATCCGAGAGTTCAAGC 

20 CACCTGATGCCAGCTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTC 
TCTCTCTCTCTCTCTCTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT 
GTGTTTGCACACATGCGCACCTGTCCTTTCTTTAGTCCTTTGTCAACCCTAGCC 
CCCATCAGGCTAATCCTTGCAGCCCAAGTTATTGCAGATTGAGACCTCATGTC 
CAGGGTACTACCCTATCTTTCTATCTGCCTGGGCTTGGTGACTCCCCTCCCCCA 

25 TGTTTGCCTGCTAGGGTGATATAGATGGGCCACACAGCCAAGGTCAGGAGGA 
CCCCATGTGGAGCTGCAGGCAGCTTGTGGACCAGCACTGGGTCCTGAGGCTG 
CAGAGGCAAGCCTGTTTTATGCTTGGTCTTGACATGAATTATTAAGGAGAGCT 
AAACAATTCATTTTCCCCTCCTAAGGGACACTGGGAGATTGTGGCAAAGTGG 
CTCTGCAGCCACCGGGAGGGTGTAATGAATAGCCCTATCAC^rCTGCATTCATC 

30 ACATAGCAGGCTAAGCCGCACTGCCATCCTCACCAACCTTGGATGCCTGCCA 
AGGCTGGAAGGGAGACCAGGAAGAAGCACCCATCCTTACTTCATGCCGGAGC 
CTGTGTGTGCAGACACAGTCAAGGTCTCTTCCTCCTGTGTCCTTCATCCTCTTG 
TTAGACCCTTGTTTCAGGATGTCTTGAGAATCTTTCCACGGAGGTTCAGGGGG 
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GACTATGGGGACTTTCAAGTAGTATCAGTTGACTCCTGTATCCTTTCTTACCC 
ATGGGACTGGAGGTAGCTAGGATGACCAAGTTCAAGGCTGTTTGGCCTTGCC 
CACCCGTACAAAGATGGTGGATGATGGGAGGATGTTTTGGTGTTTCTGTCTGT 
GCTTAGTGTCACATAGGACCTCATTCCAGCCCAGGAAAGGCCAGAACTAGAT 
5 CCACATTCCAGACCACCTGAGAGTAGAGAGGAGCCTGACTGCCCCTCCCTGC 
CACCCCTCCCrCCCATCCTTCCACCCTCCCATCCCCCCACCCCCACCCCGCTGG 
CTCATCTATATAATGCAGTAGCCCTCCAGCAGGGCCTTCTACTCTCTTCACAG 
AAGCAGACACAGAGGTCTGGGGAGGATAAAAGACATGCTCACGTCTGTCCTT 
CTACCCTTGAGTTCAAACTCAAGATGTCTTGACCCAGGCTACCTCCAGGTATC 
10 TCAACAGGGCTCACATGGGCACTGCTGGAATTCAGAACTCCAAAGATTGGCC 
CGCAGGCCAGAGTCCAGGAGGAAGCCCTACCCCGCCCAGGGTCAGGCAGGG 
CGTAGAGAATTGCCCTAGTAGCCAGTCTTATGGGGACTGGAGTAAAGGACCC 
TTTCTTTCTCTAGACTAAGCCCTGGTTGTTACTGTCTGTTCTTTGGTTCCTCTGA 
GGCCTCTGCCCATTTTGGACCCTAAGTTTAGGGCTCAGAATGTTAACATAACT 
1 5 TTAGAAC AGGGGGACGGGC AGC AGCTGCTGCAGGCCAACCCAAAGCCAGCC 
CCTGATACCTGCCAGCTGTCTCTACTGAGGCCACCCATGCTGTGACCCAATCC 
TCCAGCCATCCAGACCACCCCTCAGTGGACAGGGAATGTTCTGTCAAAATGCT 
GCTTAGCTGTGTTGCCTCAAGCAGAACTTCTGGACACCCATGCCCTCAAGCTA 
TTATGGAGGTTCAACTCCCTTTTAGGTCCTCTGGGGCATAGATAATCATTTTTC 
20 CTCTCTAATTTTAGGATTGATATGAGCCTTGGGGGATTTTCTTGGGCAGGGAA 
GACAAGCAGAAATTCTGGGCAAAGCATCTGGtTGGTCCAGGTAGAGGGTCCT 
GGCAGCACTCACCTGGGTGGGGTTGTAGAATCTCCAGAAGGCCACCTGCACC 
AGGATGCAGAATGACAGGGCAAAGCACAGGAAACCCTGGGGACAGAGGTGT 
TCTTGTCACTGCATAGCCAGGTAGGTGGCCTGGAGAGGCAGGTGGCTAGCTT 
25 CCCCAGCTGGGCGGGGCTGAGGTGCAGGTTTGAGACCCAGTCCCTTCTGCCTG 
GGGGCAGGAGGCTCTTTCTGGGAATTCCTTCCCCAGGCTCTGTGTGCTTCTGA 
GGATCTGTGACCTAAGGGGCTACACTGCTTTCCTGAGAATGGCAGGCTTTTCC 
AGTGTGGTGGAGACCCTACCCATGGGAGAAACAGTGACCCAGAAAGGGCAG 
GCtrGCAACTCTCCGTATCTGAAATCCTGGCAGCAGAACCTCCTACCCTTTAAA 
30 CATAAATGGGCCAGCATCTATGTTTTATTTTATTTTTAAATTACATGTATTGTG 
TGTGCATGTATAGGGAGGCCTGGGGAGGCCAGAAGAATGATGTTGCACATGG 
AGCCACAGGCTCTATGGATCCTGGGAACTAAACTTGGGTCCTCTGGAAGAGC 
AGGCATCCCrCATTGCTGAGCTGTTTCTCCAGCATTTCTCATATACCTAACCAC 
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CCATCCTTTGAGGTAGAGTCTATGAAGCCCAGGCTGGCCTTGAACTCCATGAT 

CCTCTCACTTCCGTTTTCTCAGTGCTATGATTCCAGGTGTGCACAACCACTGTG 

TTTTATTCAGCGCTTGTGACTGAACAAGGATGTCATGCATGCTAGGCAAACAC 

TCTACCAACTGTCCCACATCCCCAGCCGCAAACTTCTCTCTTGAAGGTAGGGA 

ATAGCCACTTCATATAGGGTACCTTCTGCTAGTTCTTTGTTTTGTTTCTGAAAC 

ACGTTTCCAAGGAGACCCTAAGCTCTATAGCCTGACTCATCCACAACCCCAGC 

CCACAACCCCAGCCCACAACCCCAGCCATTGGTTTGACACCCCAAAGACCGT 

GGCTTTCCTCCTCCTTGTGAGTTTCCTACCCTTCCTCTATACCACTGCATGGGG 

CTCTGTGGTTACCCAACTCCCAACCCTCACCCAGAGTGCAAAAAGGGTCATCT 

TCACACACTCAGCCTTTTCTTTTTTCCTTTAGTTGGGGTACAGACTTCTGACTT 

CTTTACAGAAGCCAGTGGACAATGCAGATGTCTTGCCTGACCATGTACTGCTA 

CAAGTCCCTACACTTTGTGGGTAGTTTCAGAGGGTCATTCTTGTTGAGGAAAC 

CTGTTCCCCAGCAGTGCCCACCAAAGTCCTACCTAGTGACCTCGGATCTGGGT 

ACCCTGGAGCCTCTGCCTGGCGCTCTTGATCGTCAGTCAGTTCCAGGGCCCTG 

CACATAGGTCCACCCACTTCCTGTGTGACCATCGAGCCCTCCATTTCGACCAC 

CTCCCCACTCCAAAGTGTCACTGTCATGTCGcfcAATAAATATGGGCAGCTAT 

GGCCTGGCTGTCTCAACCCAGCACCACTCAGGGGTAGCCCCCAAGTGACCAG 

TATGCAGAATGACCTATAGTCATTGGCCAGGGTGCTAGGTCCATGCCAGGGC 

TAGCTTAGGTTTACAAAGTGTGTGGAAAGAGCTGGGGTCACCAGTACTGGTC 

CTCAGACTTGGCTGGGTGTGTGCTCTGCAGGAATGGGGAATTCCCTTCCTGCC. 

AGTTCCAGAGCCTGTCTTAGAGCTGGGCCTGGGGTTGCCTGCTGCTCTGCTTT 

TCTTTTGCCTTTATTTCCTCCTCCTCTCCCCCCTCCTCCTCTTTCCCTCCTCCTCT 

CCCCCCCTCCTCCTCTTTCCCCTCCTCCTCTTCCGCCCTTCTCCTTCTCTCCTCT 

ACCTTTTCTTCCCTCCITTTCTTCCTCCCTCATCTTTCCTCTTGCTGGCAGTTGA 

GGGCTGTTTTCCCCATCTGGAAGCCATTCCTCAACTCTGCCTCCCGGCTGAGG 

CTGTGAGCCAGCCCAGTCCAGCCCTCTCCAGAAACCACTTCATCACTCAAGAG 

GGAGTCTCCCCCTCTCCCCACCAGCCCCCCCCCCCCCGTGTGTGTGTGTGTGT 

GTGTGTGTGTGTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAG 

AGAGAATGTGTGTGTGTAGAAAGGGTCTTAAGGCCCACATAGAGCAGAGAGA 

GGTTTGGCCAGACCTAACTGTGACCATTTTCTCCCTCCTCATTCTAAATGAAA 

ATAAATGAATAAACTCGTAAACAGCTTGAATTGGCTTCCTGCAGCCAGCCGG 

CTCTTTCATTTCCTGTGTTGTGGAAGGCGGGGCAGGAAAGAGCAGGGAGGGA 

GCAGTGGGCACAGAGGTGTTGGGGTTGTTTCCATTCCCTCCCCCCTTTGCCCT 
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TTATATGTACTTGTATGTGTTCTGACACACAGGCAGCACAGATCCATGGGACA 
GACAGATGCAGACAGCGCCCACCATACGCACAGGCACAAAGTGACACAAAG 
AGACTGTAACCACCAGGGGAGCTCCCAGTGACTCTGTGGTGACCGAGCTGGG 
CATACTATCCATGCTCAGGTAGAGGATCATGGGCTTAGCAAAGTGCAGAGCC 

5 AGCACTGGGCCGGGAGTAGCCTTGGGAGCACAGCTCTGTACAGGAGGAGTCT 
AGATGTGCCTTTTAGCCTAACCTGCACAACAGTACCGCACCAGGCCCTGAGTC 
TTGTTAGAGCTGTGGAATTGAAGCTTTGGAAAGCCTGGATGGGTTCTAATGCT 
CCCGAAATCCTTTGCCCACAAATCAAGTGCCAAGCGGGACATGGGAGGAGCT 
TGGGAATGCTGAGATGGGCTAAGCTGCGCAGGAGCTGGGCCTCTTGGCCCAC 

10 CTGTTTACTGCCTTCAGCTTTACGGTCTTCTCGCTGACCTTGGTGCCCCCTGGT 
GGCCGATGTTATACTTGGGCGCAGGCCTCACATGCTGCCTTCTGCGTGCCCCT 
GCCTCCATTTTGTCTGTCCTACATGCTTCCCTCTAGCAAGGCACAAATTCTCTG 
GGATCACAGCCAACCAGCCCACAGTGCCCTTTTCACTAGGGCCTCCATCTGGG 
CTCTAAATCTCGGTTGTTTTTGTTTAGTGCTTATTCATGGCCAGCCTTGTCCCT 

15 GCCCTGCCTGGTGAGGTCTGAGCTGCCAGGTTCTACTTACTCCGTCACCCGGG 
CCCACCCAGGACTGGCTGGTCCCTGCAGCGGCTCTAGGATGTGACTCTGCCCC 
TGGTTCTGGAGACCACGGGTCATCAGCAAATGTACTGAGCACGAAGACAAGG 
TTGCAAAGGGTAGATTGGGTCTACCCAGGAAGCAAAGCTCAGGTTCCTGAGG 
GCTCCTGGATCCCCCCAGTCTCTCACAGGTAGAGTTTGGGAGGTCTGCAAACA 
. 20 GGGGCGCTGGGATGGACGGCGGAGGGGGACAGGAAATGCCTGGAATGGGAG 
GGTAGGTAAGAGAGCAAGGTCTCAGGAAGGAACGGCCCACATTCAGATGACT 
GGGAAAGCCTGGGTGAAACAGGAACAGGGTGGCCATGGGGAGGGCCCTGGG 
AGGGAGAATCTGGATTAGCCATTGAGGGTTGTGGGGGCACACTGAGGCACCC 
ATATGGAGAACTGGATACGGCAGCAACAAGAGCTTTCTTCTTTGGGGTTGTG 

25 AAACGTCTCTTCAATGGCTAGGCAAGGACAGTGGATCTCTGACAAGGAAGCC 
CAGGTTGCAGAAACAGCTTTAGGGACTGCCTGGTCACTTTTGGTCTTCTAATG 
AGAAGTGGAATTCACCCAGTCCTGTCTCCAGACCTGGCTAGGTATCAGAACC 
AAGCCCTTCTGOGGTTTTTAGAGATAACCCCAGGGCAGAGAGGGCCAATTCT 
CTCITrCAGGGGCCCTGCAGCACCCAGGTAGAiKjTTTTTTTTT^ 

30 GACTGGTCCAGGTAGGAGGGTCCTGATTGAGCAAGTGGAAGGTCCTGACTGT 
GTAAGTGGAGGGACTCACTGGTCTATATCCCCtGACCTTTGCTTCTTTCTGGA 
TACTTCTGCTGGGGTCAGGCAGCTCCCTGGTCTTATTCCTCTCCAGCTCTTGTC 
CCCGTCTGAGGTCTGGGAGCCGGGGTCGTCTGGCAGCACATGGAACTTACCG 
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CAGCCATGAGGCCATGGGCCTCCCTCACAGTGGCTATGGTGCTCAGAGCAGC 
TCCCAGGCTCAGGAGCCCCGCTAGGCTGATCCCCACATAGAAGGCTGTGGGA 
AGAGAAAGACAGAGGGTATGGAATCTAGCCTGGGGACTGGGCATAGAGGTT 
GGATCTGAAATCTATCTACTTTGGGAGTCACAGAGCTAGAACTGACCTTCGGG 
5 GAGGTCCCAAAAGGCACATATCTCTGGCTCAAAAGAAGGAATGTCCATAGCC 
CAATAGCCTGTCCTCCTGAGGGTGAGAAGGAAGGGGGAAGAGAAGGTAGCT 
AAGAGACAGTACTCTTTCCCCCATTCCTCCTGTCCTCCCTGCCTCACAGACTAT 

ctTOtAQ ^ eb¥F€eee ^^ 

ATCAGTGCAGGCCCAGGCCCCAGGCTTCCTGCCTAAGGGAGGCCTAGTGCCT 

10 TGGAAGGCAGGGACTGGGGCTACCCATCCCCCATTCAGGGGCCTCAGTTTTCC 
ACTITITCCTACTGGGTGGTGACCTTCCTAGATTC 

AGTGGCCTTGTTCTTGTGGAAGTGAGGCACCCTGGCAGGTTGGGTCCTAGCTG 

CACATTTTCTGCCCTCTGTATATGGTATTCCCTGCCAAGCCCCACAGGCCCAC 

CTGAGTGGGCCCCAGGAGATCCCTGCCCTGAACACATAGATCTGGGCCCTGC 

15 CACAGCTTCACCACTGCTATGGGGCAGGCGAATCAAGGCAGGAGTGCCTGCT 
GTCGCCTAAACCTCCCCACCTTrGTTAGTCCAACCCAGACACTGGGGCTGCCT 
CTCCCrACTCCTGACCrCTGGTTTTTGGCCAAAGCCAGCTCCTCTGGGCAGCC 
TAGGGCTGGGGAGTAAGGTCTGTTCCTCCCAGATGTGTTTGGCCCTGGAGCCC 
TCTCACTTCCACTCCACAGCCATTTCTGGTCTCACTTACCCCAATAGCGCAGT 

20 GTTTCATAGGGGTTCCTCTCCAACGATGCATGGCCAATCACAGTAAAGTGGTC 
TCCAAAGTGAATGACAACCACCATGGTGGCCATAGAGAGGCCCAGCAGCTGA 
GGGTGAAGTGGGCTCACTGTCAGCGCTGGGGAGTTGCAGAGGCCCCTCACCC 
TGACAGAATATAGGGTCTGGGCTTCTTAGGCCCCCAGCCTGGCTCCCTCCAGC 
TTCCTTGGTGCACAGGAACAAGGCAGCTACAGGAGGCCTACATATTTCTGTG 

25 GGCAGGCACTCACAGATAGTGTACTACACACAGAGATAAGTAGAAAACACAC 
TAGCCAGTGCCCCATGGTGCTCAGCTCCTCTCCTCTCCTCTCCTCTCCCCTCCC ' 
TTCACTTCCTCTCTCCTCCCTTCCTCTTTACTCTCCTCCCTTCTGTGGTTCACCA 
GTGAGTGGGGACCTCAGACCACAGCAGACACAGTAGGTATCGCTGTGTCCTC 
CCATCTGAAGACCCAGGAGCTCATATCAAACTGTCTGCAGAGACTCGTCTGC 

30 AGTCCCTGACCTGGCAGTCCCAGTCTCTGGGTCCCAAGCAGCACCCCTTCCGG 
GTAGTCCTTGCAGACTCTTAAGTCACCAGCCAACCTAGGTTTTCCCAGGCTCT 
TCACTGACCCTTGCCTACAAGGACATCAGGTACAGATGTTCACAGTCCTCTAA 
AGGTGGCCACCAGGGACCCAGAACCAGTCTGAGGGACTGTGGTGTGGCAGGA 



WO 02/054069 



116 



PCT/USOl/49808 



GGGAAGACCACAGTGCTGGAAGCCCGGGCATGGTGAACTATTTCCTCTCCTG 
CCTGCCCACCCATCCACTCCTGCCCCACATGCATGTTACCAAGACTAGGAAGT 
TGGTGATCAGTATCTGGCATTTGGCAATCTTGATTCGATTCCAGTGCCCCATG 
GTGCTCAGCTCCTCTCCTCTCTGCCCTCGTCTCCTCTCCTCTCTTCTCCCCTCCC 
5 TTCACCTCCeCTCTCCTCCCTTCCTCTTTACrCTCCTCCCTTCTGTGGTTCCC^ 
GTGGGTGGGGACCTCAGACCACAGCAGACAeAGTAGGTATGATTGTGTCCTC 
CTATCTGTCAGCAACCCTTGTAACCTAGTGGCTGTGTTGGGGGAAGGAAGAA 

GGGCAGGeeGGAAGGGGGGGA^H^GTA^ 

CCAAGATAACTGGATTTTTGTAGGCATGAGGCCTCCCTTTGGGTAGGCTGGGC 

10 TACACTGCAGCATGGGGAGGGGTGGAGTCTTGAGGGGAGCAGAAAGAGAAG 
GACCTATGGTTATAAGGCATGGCATTGGTTGTAAGGGTGTACTAACCTGTAAC 
TGTCCAGCTACGTAAAAACCTTTAGTGTTTGGTTAGACCTCATGTCCCAAAGT 
CCTCCTGGGACTAATCCCCAAACAGGACTTCTGCAAGTGGCAAATGTGTCTCA 
TGACCGACCTGATGGCATTAACCAATTCATAACTGGCACAGAGGGGAAGGGT 

15 GTGATAGAGTCAAATCTCAGGAAATGTCCAGGGGTTGGGCAAACAGGCATAG 
GCGTTTCAAGATGGAGGAAGTCTGTTTGTATTGACCTTCGGTGAACTAGCCAT 
ATTTGATTGAAAAACAAAAGCAGAGCATGCTTCCCACAGCCAGCACCTGGGA 
GGCTGAGGCAGGAAGGTATTAGCAAGTCAGAGGCTAACCTAACACACTCAGT 
GAGACCCCCACCTCAAAATACACAGGCTGGGGACAGGGCTCAGTTGAAGTGT 

20 TTGCAGTGCAAGCATGAGGACCTGAGGTCGGATACCCCAGAAGCCATGTTAA 
AGAAGTCAGGCATGGCAGCGCATGCTTGTAATCCCAGTGCCATGGAGGCTGA 
GGCAGGACGATGCTGCCCGGACTCACTGGCCAGCCAGCCAGTGACACCTGAG 
GTTGACCTCTGGCTTCCACGTGCCTCCACCTACACCCTGCCCCAGGAACACAT 
ACATAAATCAATACACGATAAGAAAGAGCCATCTGAAACTCTGTCAGCCACT 
25 TTTTGAAAAGGAGCAAGTGTTTGCTTACTTCAAGAACTGGAGCCCCCCCAAG 
AGGGTCGGTGCCCCACTCCCATAGGCTCTCTGTCAAAATGTGCGTCTGCAATA 
ACACACAGGATGATGTAAGCCACGGAATGGGCAGTACTGCAAATTCAAATAT 
ATGCAGGCCATGCTAAGTGGGCTTTGGAGAGTATAACATGAGTGTTCACCTTA 

aaacctgg:tatgtctaagcctgtccagatatctgaggctgtttctgggtacc 

30 CAGAAAAGGATGCATGGCCCTCTTTTCACACCCTGTAATGTAGGGTTCTCCTA 

gtcagggggcatcttggtaggaaaaagaggtaactgtggaagtgatccctgg 

GCCTTCTAGAAGCCAGTGAAATAGGGCTGTGTTCTGGGCTGTCAATGTTGCTG 

ctgggaagattgtcaagtggccacccattcaggaagactgacattggagcct 



WO 02/054069 



117 



PCT/US01/49808 



CCTGCTGTAGTCCCCTCTTTTCTCCTTCTTCCTAAGGGGCTGTTCCAGCACTCC 

AGGTGTCTCCTCTGCAGCCTGCATTTCCCAGAGTCTTAGTTGTCAGCCTGTAG 

GGTAGGTGAGGGGTGACCACTATATCACTCAGGGAAGACTCCAAGACCCAGC 

CAGGGTGCAGAGCAGCTTAGGTGCAGCCAAGTCTCAGGGTGTGGCCTAAGCT 

TCATGTGTGCCAGGCCTCCTGCAGCAGAGCCTCCTGCTTGCTCAATCCTTAAC 

CACCTTTCTCTACCTGCATGGAAGAAAGAGTCAGGCAGCCTCCAGGACCCAA 

GATGGGCCTGGAACCACCTAGAACCCCCTCCTCAATTACTCCGTGCCTAGGAC 

-?^<jAGTGGAG€€Ai3GCQCTTTC 

GGTAGGGACCCTAGGCTTGTCCCTCCAATACTAGGTCTGTCAACCCTCTCTTC 

CCCGAGGTCACCAGTGGGGCCAGGAATCCATGTGTTGTGGGACCACATAGCA 

ATGTGCCCTCTTGGATACCTTTGCCTTCCACCCTTCTGCTCTGACACTCTCTGT 

AGGAGGAAATGGTATGTCAGAGGAGCTGCTGGCCTGCCTCCTGCCTACCTCA 

GGGCGTTGGGCTATGGCrCATTCCCAGTAGAGGGCAGTGCTGGCTCTCTGAA 

GTCAGGGAGGGGCCCAAAGCTTGGGACATGATTTTGCCAGCAGCAGATAGTT 

TCTGTGACTGTGTGATGTTTGGAATTCTTGGGACATTTCAAAGGGTATAAATG 

CTAGGGCTCCGAGAGGTTGGGTGGGGTGGTGATTGATTGCTGTTGGTCATGGT 

TTGTTGAGTAGCCCTGCGGAAAGAAGAGAAAAGAAAGAAGAAGGAAGAAGG 

AAGGGGGAAGAAGGAAGAAGGAAGAAGGAAGAGGGGGAGGGGGAGGGGGG 

AGAAGAAAAGGAGAAGGAGGAGGAGAAGGAAGAGGAGGAGGAATTTATTA 

TTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATATATCCTGA 

TGGCAAAGATCAAACTTGCCCCAAGGAACCAGATGCCCCTATTTCAGCAGGA 

AGCAGTCTAATGATAACACTACCCCCTTTCCTTTCTATCCITrTTCTCTCTCCT 

ACCTAGTGTTAGGGGTTTGAAAGGGTGAAAGAAAAGGGGTGGAGAAGGGTA 

GAAGAAAGAGGAACCTACAAAGTAGCCAAAGTCCAGCTACAAGTGGCACTC 

AGATGTAGGCCTAGGCAGAATGGGAAATTTGATTTCATGGCAGGGGACAAAG 

GAATGGGTATTTTATTCAAGACACCAGCAAAAAATTTATTGTGGCTGCTACCA 

CTGGGATATACCTTCCTTTGCTCCCAGAGAGGGTTTTCTGGTTTTTCTTCTTCT 

TCTTATATTCTGTTTCAAAAAAGGTTGGGAGAATGGTTAAGTGGTTGGAAAAA 

TTTTGGGTAGAAGTGGAGGGTCCrrGAGAAACAAACAGTGTGAGGAAAGGGA 

AAACAAAAGGTGTCACTCTGCTCTCTGACGTCTGCAAGGGGTGCAGAGTTGG 

CTGCAGCTGGAAAACTTGCAGGTGGAGGTGGAAGCGCCGGGGAGGCGAGGA 

ACACCAGAAAAGGGAGAACAAGAAGACTCAGTCCTGGGGACTGAGCAAGAA 

AGCAGGAGTTTCCTGCAGGAGGAAAGAGGGATAGAAACTGTGCCAGAGGGC 
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AGTCGGGTCAGAGAAAAGCTGGCAAGGAGGGAGAAAAGCTGAATGGCTGGT 
GAAAACAGTCCATTTCAAAAGAAAGGGTTAATCTCAAACTAACTGCGCTGCC 
AGAGTAGGAAGAGAAGTGGCTTGGGGCTGGGAGATCACCATCAGCTTTCCCA 
ATAGTCTACAGCATATGCCTGCTAATGGTGTGCAACAGGGTTATGACAAGAT 

5 AGGATGGTATCCTATGGAAATGATGGTTCGAAGAGGCCATGGTCTCATATGG 
GAGACACAAGCCTCACATGAAACAACTGGGCCACTCAAAATGATTGCCTAAG 
ACTGAAAGGAAATGGTTACAGTTGTACTAGAAGCTGGTCTGTAATTGCAATA 
GATAACATGATGAAAGGAAGAAGCTTTGAATATTAAACAGGGAAACATAGTG 
ACGGGGAGGGATATAGTCAAAGAACAGTTGCTAGGCAAAGGGCCCTGCTCAG 

10 ATATGCAAGAACAGAGTCAATTTGATGATGCTACAGTGGAACAATGTTGCTT 
AGTGCCTTTAAGAGCTTGGGACAAAGTTGGAGAACCTGGAGAAAGGTCCATC 
TTGTTTACAAAGATACAAACATTCACTTTAAAAAAAAAAAAAACAAAAAAAA 
ACAAAGATTAGTCTCAGCTGTGAATGAAGCTGTATCAGACCCCGATGAAAGA 
CAAGTATTGATAGAGACCTTGGTGTTTGAAAATGGGAATACTGAATGTAAAG 

15 TGCTATTAGACCATTAAAGGCATGTGCAGTGCTTATGCATGAGTGAATAAGG 
GATACAACCGGTATCAGTTCTAATATGTACCATGTAATATCATAGGTCAAACT 
ATAGCTAGAGATCTCTGATATCAAAATTCCTAGTACTTCAATTGCAGGAAATA 
AAGTCATTTCCAAAGAGACTGTGACCAAGCCACTAAAGAGCTCAAATCTCAA 
AATGCTCAGTGCTTTAATTGTGGGAATCGTAGTCATTTGCAACAAAATTGTGA 

20 ATAAGGCATCTCTAAGGGCAACAATTTTCCTAAATATAAACCAGAAAGAAGG 
CCTAGGCTTCCAGGAGTGTTTAAGCAACGTGGCAAGGGTTGCCATTGGACCA 
GTGAGTGTAAGTCCAAGAGAGACAGTCAAGGCAATTTTTTGCCATTGGGAAA 
TGAAATGAGGGGAGCTGGCTCGAGGCCCCCACAGCAAAAAGTACAAGCTGTT 
TTGAACAGCAGCTGAGAGACACTGGGTTATAGAAAAAACTGCAGAATTAAAT 

25 CAGTCTATATACTATAAGGATGTGTTAACATCAGAATGGAAGCCTACAAAAG 
TTTTACAATGGGGATGCAGGTTTGTTTACATTTCCACGGAAATGAAAAGCTGT 
GGATATTATCCAAACTGATAAAGATTAGAATTGAGCAGATTAGATTTTCAGA 
ACACCTTGGCCACTGATCAGAAAACACTTATTTAATTGAAAATGTCTGTTTAT 
AATTTCCAGCTACATCCAGCACAATTAATGACGTAAAAiAGGGGGGAAATGT 

30 AGTGACAATTTTTGAGTTTTGGTTTCTTTATAAAACACAGAAATATTATTAGA 
ACATTTTTAATTCAATCCCAGATGTAGGGTATGGGGCTGCTTCAGATTGTCCA 
CAGCAGCTGTCTATGATTTGCCTCGTGCTCTAGCAGGGGCATGATTTTACCAG 
TGGCAGATAGTTCTGTGATTGTGTAATGTTGGAATTCTAAAAACTTTTCAGAA 
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GGTATATAAATGCTAAGGCTTGAGAGGCAGGGTGGGTTGTTGGTTGTTGGTTG 
TTTGTTCTTGGTCAAGGTTTAAGTAGTGGTGCACAAAGAAGAAAGAAGAAGA 
AATTAGATATTCTGATGGGAAAGATCAAACTTGCTCCAATGAATTAGATGCTT 
CTAACCAGTAGGAAGTAATTTAATGGTAATGTTGTTCCCTTTCCTTTCTATCCT 
TTTTTTCTCTCCTAGCTAGTGTTAGGGGTGGAAAGGGTGGAAAAGGGCTGGA 
GAAGGGTGGAAGATTGTGTGTATGTGTGCGTGTGAGCATGTGTACCTATGCAT 
GTGTGTTTATCTGTGTGTATCATCTTCATGTATGTGTGTGTATCTATGTATGTT 
TCTGTGTTTGTATATACATGTATTTGTGTGTATAGGTTTATGTGTGTACGTGTG 
TATTTGTGTTGTGTGTATTTTTGTTTTTCTATGTATATATGTGTGCTTGTGTATA 
TTTATGTATATATGTTTATTTGTGTGTTAGTGTGTGTGTACATTTGTGTGTATTT 
ATATGTGTGTTATATATGTTTGTGTATATCTGTGTGTCGTTTTGTATATCATGC 
TTGTGTTTCTTTGTATGTGTGTTTCTATGTATATGTGCTATGTTTGAACGTTAT 
ATATTTGTGTGTGTGTGTTGTATTTGTATGTGTCTGTGTGTTATGTGTATGTGT 
CCATGTGTGTATATTTGTTTATGTGTGTGTTTGTGTATATATCTATATCTGTGT 
GTGTGCCTGTGTGTGTATTTGTTTATGTGTGTATGTGTCCATGTGTGTATGTGT 
CCAGTTTATGTATATGTTTGTGCATATCTGTGTGTGTGTGTGTGTGTGTGTGTG 
TGCGCGCGTGTGCGCATGTGTGTATGCACTGCACTCACAACAGTGTACTTACT 
GCTGGAGTACAACATCCTACTTTCTCTCGCCTTCCTTCATCTAATAATGAGTTA 
GTTGACTAAAAATGGGTTGATCCAAAAGATAGCCAACAGAACATTTTATGAA 
TAGTGTTAGATTTCAATTCACCATATTAAAAAAAAAATCAATGACCTGAGAG 
ATGACTCAGTGGCAAAGGCAGATGCCACCAAGCCTGATGACCTGAGCTCAAT 
TCTTGGAGCTACATGGTAGAAGGAGGGGATTTCTGCAAGCTGTCCCCTAACA 
GCACCTCCATATGTGTGCTGTGGCCCAGCCCAACCCCACTGTGGATCCATGCC 
AGCACAGGGAAATGATAGGCTATATCAGTGAGTGAGGAAGAAGAGAACAGA 
TCTCCCATCCGGAGGAACCCCAGAGAAACCATGCTAGGGCCCCACCCTCCTG 
GAATTAGGGCAGATGTCAGCACTCATAGGCTCTTTGTGTATTGGACAGTGCAG 
ACAGTCCAGCCATAACACTTAGGGACAAATCCTGCAGATGGTGGGGACCTGT 
GGTGGGAGGTAAGGAGTGAGGTATCTTCCATAACTTATGACCCCCATCTAAT 
- ACAAAAAAAGGTGGGGGACATCTGCCAGCTCGGGCCCTGCAGGACACCTGAA; 
CAGCTCCCTCAGAACCCATGGTCACATCAAGAACAGTAAGAAGTCTTTCTAGT 
CCAGAAGATGCAGGTCATGTAGGATCCTGGATGGACCTTGGGGCCAGACAAG 
GATAATCAGGGACACAGAAGATGGGTATAGTTGACAGCAATACATTAATACT 
GGGTGACAGCTTCCCCTCATCAATGTTACATGTTGCTGTGAAATGTAGTTTCC 
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TGGCCCAAGTTCTTACCATATGTGCCCAACCTGTGGTCTGTATGCGACTGAGG 
ACAGCTAGCTATACATATGGCTGAACATAAAATTGTGAGTATACTTAATACAT 
TGAGAGTTTTTCAGTTTTGTGCATGCGCGCGTGCGTGTGTGTGTGTGTGTGTGT 
GTGTGTGCACGCGGTCAAAGGTCAACATTAGGTGTCTTTCTTGATCACTCTGC 

5 ACCCTCTTTTCTCTTTTTTAGATTTTTAAAATTTTTATGTGTATGGGCTTCT 
AACACCACATATAAGTGCCTGAAGAGATCTTCCAGAACTAGAGCTACAGTTG 
GCAGTGTGCCTTTATAAGCATTCTAGAAACCAAGCCTGGGTCCTTGAGACGA 
GCAQC^^Tr;r.TrTTGCAGCT GGTGCTCTTGCACCGGTGCTCTTGACTGCT GA 
GTCATCTCTCCAGTCCTTCACACCTAGTTTTTAAGATGACTTCTGGGTTCAAAC 

10 TCAGGTCCTCATTATGCATAACTCACTTTACTAACTAAGCCATCTTTCCAGCCC 
CATGGTGGAGGGTGTCTTTTGGTCTGTTTGTTTTATAATTCAATTGTAAGGCTG 
CCACACATGTGGfGACAAGATTGTGCCATGATGAACATACCTAAAACCCCTTT 
GACCCTCCAGAGTGGCAGGAATGTCATTCATGCATTAGCTAGTGGTTCCCCAG 
GCACCCAGGTTGCCTCAGGCTGTGGCTGTGCACCAGGAAGTTGAAGATAGGT 

15 TGGGAGGTTTGGGAATGTCAGTCCTGCCCCTTCCCCAGCCTCTAGGGAGGGG 
AGAGCAACTGGAGGCCAGTGATGGCATCAGCCTGACTGTGCATTAAACCAGT 
TAAAAGGACAGTGTAGGAATATCCAAACAGCTGACCATGGGAGGTGCTGAGG 
GCAGTGGGACACCTCCTCCCACATTCCTGTTCTTGAGGCACACTGGTGCCGTA 
GCTGTAGTCACTCTTCATCATGGCTTTTATAGTAAAGTAGTGGGCAGATATAA 

20 GCCAGAGCTCTCCTGAGTTCTATGGGCTATGCCAACAAGTGACAGACATGAG 
GAGGATCCTAAAGGAACCCTAACACATAGGTTACAATCTGAAGCTATGATTG 
GCACCTGCAGAGGGCTGGGGTGTGCACTCTTATTGGGCGGAGCTTTCTGCCTC 
AGGATCTGATGGCATCTTTGAGCAGGCTGTGCAGAATTGAATGGAATTAGGG 
CACCCAGCCAGCTTGTGCTGGAAATGCTTGGTGTGTGGGGTCATAGCTGGGTC 

25 CTATGCTAAGAGCATTGGAAGGACATAGTTATTATTTTTTTTATGATCCCAGC 
TGTAATAGAGGCTCTGAGGGGGGTCTTTGTGAACTCTAGCACACCTCAGCATC 
CCCATAAACTCCAAAGAGGTTTCATTTCAAAGCATGTTTTTGTCTGTCCCCAG 
GTTGTCTCGCTGAGCCACATCCTCTTGGCCTTACCCCCCTACTTCTCCTTCCCC 
CTGCTCCTTCTGCAirCTTAGAGGGGAGAGGACAACACCTGCTACCTGTCCACC 

30 CACCCACCACCAGCTTGTCCTCACCCTCCCCTCATGCAGTTGTCCATCTGTTCT 
TGTGTAGCTTTCACTTCACTGGTGCCTCAGTTTACCTACACAATTGCCTCCAGG 
AACCCTGGTGGAGATAGAGACACTTTGGCCTAAGAGCAGGCTCCTGCAGTTT 
GGAAGTGGTTACTCTCCTCTTTTGTCCTCTTAAGAGCCACCTGGATGGAGAAC 
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CAGCAGAGTGGGACCAGCTTAAACAGCCCCTGACATCTGAGGGCCTGGCAAG 
GCAGGGTGGATGGCACACAGCTCTTCCCACTCAGGAAGGCCAGCCTGTCTGT 

tacctgggcacatagagacacctgcatccaagttctcttcttgtcctgtcact 

gcctggagagtggggctccaagctcgaggggactcttaggaaacctgaactg 

gaaatggcctgacttttgcctttgttctgattctgaaattgtccactataatgc 

aagtgagaggagccgttaggagctgccttctaccccccacctgagcccaggt 

gtcacttgctctccagctctggtatgtggcagctgagcctattctcagttcac 

actcacagctgctccagtagaggacatcaaggcttactctgctccacccaccg 

ctgtcccttctgggagtgaggccctcccttcctatcagtgggacccagcttgc 

cctgcaggctgcagctgccctccctgggggtgtccagggtccataaccccctc 

tcaggctccaggaacacctggaggctcacctctctacctccacactgtcccat 

tctttagtgtaaggaaactagaagcttccaggactaggcctaggttagcaga 

tggggacagtaagccttggagatgacaattacatcaggagaataaagattac 

atttacagggcagtcattagagtggcctagcttgtagatacctggacctggct 

aggtagtcatgccatctgaaggactggcactgaagtctcagcagacagcacc 

agtgaggagcttcaggcccagcccacgggcagcactcacagaaggcctaact 

ctctcaaggcccagttcagtatcccagatggaggaagggtcctggtgtgctat 

ccaatctcacccttttcctctttctcccagcctcccaaagggacctgcagctac 

tcacctggtgcctttggtttgggaaagagaaacttttaactttggggatgctt 

ctggaccatttgttggtactcactgcctctggagacagaggcacctctgtgac 

taatggctgtggtaggggctggcagggtcaagtgatgcatggaatttcagtt 

tggatctgactcctagtgaccagtgaattctctagacccccttttggctaatt 

cctagtcctaggtccagcagtagaatagaaggcatagctactcttggagcct 

caggtttgttttccaacctggacittgatggtgagaaagaacatggggaagc 

cactagaaccacctccactgagagcacataaaccaaagctacaaagaaagat 

tttagggtagtgagtctaccatcaaaggcttgaggtgggtaaccctagggaa 

gtgggtaaccaaccttagggtggtgtgtaactaacctcagggaagtgggtaa 

ccaaccttagagtggtgtgtaactaacctcagggaagtgggtaaccaatctt 

agggtggtgtgtaactaacctcagggtgfitgggtaaccaatcttagggtggt 

gtgtaactaacctcagggtggtaggtaactttacctgtgagtgtaactgttgg 

cccaagtcctgggattgccggtacaaatcaatacacatctctagacctagtaa 

acagccactaggttggtagaatcaatcaatcaatcaatcaatcaatcaatca 

atcaatcaatctctatgtgtgtgtgtgtgtgtgttgcgtgtgtgtgttgtgtgt 
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GTGTGTTTGTGTGTGTTGTGTTCTAGGTCCCCCCAGCACCAGGTGTTGCTAGT 
GTCCCTGACTCTGGCCATTCTCATAGGCAATGATGTCTCCTTATTGTTTAAACT 
GCAGCTCTCCAATTCTGTGAGATGGTGAGAGTCTGCTTGTCTGATAGCTGCAG 
GTCTTCCTTGGTGAAGTATCTACATCTTTTAAAT.GACTTGTTTTATTATTCATA 
. 5 TTTAGGGGTTCCTTGTATATTTTGGATAGGGGACCTTTAATATCTAAAATATA 
TTTGAGGGTGCCAGAGGGATGTATCAATGGTTAAGAATACTTACTGCTCTTGT 
AGAGGACCCAGCTTTTGTTTCCAGCACCCATCTGGTGACTCATCGTCATCAGT 
AACTCCGGCGCCATGGAATCGAGTGCCATCTTCTGGCCTCTGCAGGCGCCAGG 
CATGCACATGGCACACATACATGCACATAAAGCACTAATACATAAAAATAAA 

10 TAAATCTTTTAAAAATTTTTTGCAAAGATA • 
CTTGTCCTCTTCCTCTTCTTTTAAAAGAAT^ATTTATTTATTTATGTCTATGTG 
AGTGTTTGCACACTTGTGTGTAGGTACACATATCAATACCTCCATGGAAAGGC 
CAGAAGAGTGTATTAGGTATCATTATATATCACTCTTCACCCATGCCTTTGAA 
GCAGGCTTTCTCTGCTAGATTGGCAACTAACAAGTCCCAGTGATCCTTGTCTG 

15 AGCCCCCTCAGAGCTGAGGTCACAGGTGTACAAGGAACTCCCAGATTACTAT 
GTGGATGCTGCGATCTGAACTCTGGTCTTCACGATTGTAGAGCAAGTCCTCTT 
AATGACTGGGCCATATCTCCAGCTCCCCAAAATCCTGACCCCCTCTCTTTGAT 
ACATATCTCATGTATCCCGGGCTAGCCTTGAGTTCACTGCATGATCTTGAGCT 
TCTCCTGACACCATCTCCCAAGACAGGTATACATTATCACGTCTAATTCCCTG 

20 TCATTCTTTTAGCAGCATTTCTGCACAGCAGATACATGGTTTAGGCATGAAAT 
CCCTCCTGCTGGCTCATGTATTTGAGGACCTAGTTCCAACCTGGGGGTATTGT 
TTTGGGAGGTTCCAGAACCTTTAGGAAGTAAGTCCTCACCGGAGGAAGTAGG 
GCACAGTGGGGTGGACATAGAATGTCATAGCATGGCCACACTTTCTAACCTA 
TCTCTGCTTCCTGATCCATCCAGATGTGAACAGGCTGTCTCATGTTTCTGCTGC 

25 AACGGTTATGAGCTGCTCCGATTGCTATGACTGACTGGGAGCCAACACAAAT 
CGTCCTTCCCTACTACAGTTGTTTCTATCAGGTATTTTGTCAAAGCAACAGGT 
AAGGTGACTAAAACAACAGAAGTCTTCAATTTTAATAAAGTCCAATCTCTTGG 
TTATTGACCTGGTGAGTACACTGGGTTTCTGTGTACATCTTATTAAAACTTACC 
AGGACATTTAAAGCTGCTATTTTCAACTAGCAAGAAGCACACC4TTCCGTCCT 

30 ACCTCGGGCTTGGAAATACCTCATTAGGCTCCAGCCCTAGGGGTCTCTATTAA 
CTTCTTTTCCTGGAGGCACCACGCTGCTCTGCTGCTCCATAGACAAAGCTCTG 
CTTCCTGGGCTCTCTGAGCAGGAAGCTGTAGGCATCTTCCCTAGTTTTATTTAT 
GCAGCTGTGATTGAAAGCCTGACAAAAAAGCAGGGAGAAAAGAGTTTATTTT 
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AGCTTACAATTCCAGGTCACAGTCCCGTCATTGTGGGGGAAGTCAAGGCAGG 
GACTTTAACAGCTAGTTACATCTCATAGCCAAGAGCAGAGAAAAATGAATGC 
ATGCATACTTGCTTGCTTGTGCCCAGCTGTATGCCTCTGTTACCATACACTTCA 
GAAGCCCCTCCTGCAGGTGATGGTGCTACCCATGGTGGGCTGGGTCTTCCCAC 

5 AATTAATTAAGACAGTCCCCCAAAGACATGCCATGGGCCAATCCAAAGTACA 
AAAATCCCTCACTGAGACTTTCTTCCCAGTCAACAACTCTAGGGAGTATCAAG 
TTGACATTTAAAGCTGATTTTCATAATATGTTAAGATGGCTTGGTCACATGGT 
ATGTGTTTGAAGGTCAGGTGATCAAATCCAACAGGATTTGGGGGAGGTTTCT 
GCCTTGATGATATTGTTAGTGTACATTCGACTGGGTCAAGTACATGGTCTTTC 

1 0 CTACTCTGT AAG ATAC AGC AC AGTGTCTAGTGGCTCTCTTTGG ATTTCCCAGT 
TAGCAGATACATATTTAATATCTCAAAGGGTTATTATCAATTTTCAGTGAGGC 
CCAAATTAGAAGAGGTTCTGCAACAGGCCCAGGCTGCTGTGTAACCTTCTTAG 
TGGCATGAGCGGCTCTGAGCAAAGTGCTTATGGGAGGTAGGACGTTCTCTTG 
GGGCTGTTCGGGCTCCTGAAAGGACTTCACAGTCTACCACCCTTGGTCTGTGA 

15 GGCTGAGCCCTGCCATCCTCTATGGATGCTTCTTCAAATCTTGGATTACCTCTA 

ggctccacagagcatgtggccacgtgacaccaactgaccacatgacctgaac 
tGtccattataaaagagccattgtctgacctgtgatccattgagattggggaa 
acacaggtgtgtgtcaccattttgtagaggcagtggatgcaaggcaagcctg 
ataactaacaaccccgtgttagttacrgcactcattcctgggacaagatattt 

20 cgcaaaagcaacttacaggagggtttactctgattcaccatttgagagtaga 
gtccatcgtggtgaggactgaagtgtggtgggagtacgaggcagctagcagc 
actcagctaagaagcagagcaagatgggttcrggtgcttccctttgcttgctt 
tttttcatttcagtgcccctgcccagggaatggtgctacccacatttagggtg 
aatctgttgatctaatctgaaaaaaaaaatcacctttcagacttgccctgaga 

25 tttgtttctgtggtgattctaaatcccagtaggttggtaatcaagattaacta 
ctgcatactacaaagctccatccagcttttttgaagagctcccacgatgctgg 
tagctgttcctgcttctttcccagtgcctccaggcagcacctgcacttcatatg 
caatttcctattactcggtcactgatagagaacaagtcttggatccgcttcca 
attc'rtttggatccgcttctaattgctttgcttgtcatgcgggtgacctgcagc 

30 ctcctatgaggaatggaagcaccgaagcagacaggacttggagcaatcatcc 
tggttattctacttgacctagaggagagataaggtttacaagttccatgcttt 
ggtggccattaggtcatgatgatcagggaattggaaggaaagcatttggggg 
ggggaatggtggtacgaggaggttagagcaaaggtttatctcttccacagtg 
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CACAGCACAGAGGGGAAAGGACTATATCTCTCATGCAGTGCACAGCACAAGG 
TGTGAAGAGACCTATGTCCCATAAGGATGTTGGGCACAGAAATAAACCCATG 
GGCTGAAGGGAGCTCTATGGTGGATGAGCCCCAGATATCACTGTCACTGCCA 
AAAGGGCTTGAGGAGGGCATGACCTGGAGATCACATGTGGACTGGATGGTGC 
5 ATGGTTGTCCATTCATTGAGGTTCATCTGCTAGGGTTCCTGCTTGGCACTCACT 
GATGGGTGATATTGGAGTGCTGTCACCATGAGAGAGACTCTTCTGCTGTCACT 
GACACCAAGGCATACCTGAATGTGAGCTGGCCTTTCAGGTCCAGGGTGCCTTC 
TTAGTATACTTGACACTCTGCTGTGTATCTTCTGGTTTTCTTCTGAACAGGGAC 
TCAGCACAGGCAGAGAGTGCAAGCTCAGAGAATCGCTGTCCTTATCATTCTCT 
10 ATCACCCCAGCCTGAGGACAGAAGACTGCATTTTCTGCACCCCAAGGAGCTG 
GAACAATGTCCTCTGAATCCATCAATGCAATCGGGTTCCTGTAGCCACTGCTC 
ATGTTCCTGGAAACACAGAGGTTTCCCCATCACCCATCAATGATTATTTTTGT 
TCTTCATGGAAAATCCACCAGTGACTGGCAGATGGAACCCAGGGCTTCTGCA 
TACATTAGGCAAGCACCCTGCTACCAAAGTTATATCGCATCTCTATGCCAATA 
1 5 ACATTTATCITrCTGTTCTCACAACTCTAGGTTCTCTTGGTTAAAACTTCTACC 
ATAGGCACTACACTGACTCCTGGTACCCTGGAGGTTGAGGAGACCATGGGTC 
CACTTTGAGGTACTCCTGTAAGAGAACCACCAGGATGCAAAGGGATGATTCC 
AGGCTGGTACAGCTAGAAACAGGCTGTGGTTTCACAATAGAATGCCTAGAGT 
GTACATGATACCCTGCAGCTCTGGTCCTGTGATTAACATCAGTGGAAGAACTC 
20 CACTGCCCAATCCAGGCAGGACTGCCGAGGGCGCAGACTCTTCAGGAATGAA 
GGCTTAGGTCACCCACCAGGTAAAGGGCCACAGTCGGCTAAAGCACTCGCTG 
AAGCACAAAGACTTTCAGGAAGATGGTTATAAATACCAGCCAGGCCATGTGA 
CCAGCTGCAGCCCTGAGCCTATAAGATTTACAGAATTCTTCCCTATGCTCTGA 
GCAAACAGGGCTGAGTGATTCTTCCTTCCCCTCCTGCAGACTCCTGCACCTAG 
25 TATTTATGAGTTCCTAGAGTTGCTAATTTTCTCTCTCTGGATTTGCTGCAGCTC 
AGCTAGAAGGGGAATGGATAAGGACACAGGACCTCCCATCTCTGTGGGGGTG 
TGGACGTGCATCTGGCTATGCAGTGTGGACGTGCATCTGGCTATGCAGTGTGG 
ACGTGCATCTGGCTATGCAGTGTGGACGTGCATCTGGCTATGCAGTGTGGACG 
TGCATCTGGCTATGCAGTCCbAGCTGTCTGTCAGGTGAAAGTGGAACCTTACT 
30 TGGTTTTATTCAGAAATCTCACCAAAGGGTTTCATTTTTGTAAGGAGCTCAAA 
ACAGGCAGACCCTTAGAGGCAGAAAAACGGCTTCCTGGTGTTGGAGAGGGAC 
TACCAGGAAGAGTGGGTGCTAATGGCAGTAGGGAAATAGGATCTGGTAAGCT 
AGGAACATTCACACACCAGATAGTGGTGGTGGTCTGGTGGAGTTACTGAAAA 
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CCACAGATGGGTGTGTTTACTCAGAGCAAATCAGATCCCATGCCAATCAAAT 
GCAATGGAGTTTCTTGTTTTCAAAGGGAAGTTATCTGTGCATTCCAGGGGGAG 
GGGAATCTGCATGATGTCACCTTCACAGAACTGAGGGTGGCAGCCATGGGTG 
TGGAGGTGAAGTGGGGACCCTCTTTATGCCTCACATTTGTTTCTGCCTGGCTC 
5 CCTAGAGCTTCTCCCTGAAGTCTTGGGTGCTGGCCTCCTACCGTGCTATTCAC 
AGGTTGGTCTGTTCCAGATTTTCTGGGAGCCCAGCAATATAGAGCCTGACCTC 
CTGCTTCCCACTTCTCGGCCACTGGGACTCCAGATGGCTACTGTGCCAGCCAG 
GACATGTTCCAGGAGCCACCAGCACCCCCCTCCCATAAAGGGTTCTTCTGTCT 
TGGTCCTAGTCCTCCCTATAAAGGGCCCGGGGAAGTGGGGGGGGGGGGTATC 

10 TGTCCCCAGCTCTGGTTACCAGTACGTGACCATTCTTGGTCTTATGCCTGCTCT 
CCTCTTTCTTTTCCTCAAGTGCAGACTACAGCAGAAGCCCGAGATCATCAGAG 
ACCCCATCCAGTCCCCAGCAACCTGTACAGACACTCTGTGCCCTGTGGTAAAG 
CACACAGTTGAATTGTTCATCTTTGCTGTGACTCAATGATAGGTTGGCGGTAT 
TCACTATCTGTTGCGCATCCTCCAAACAGAAACTCTGTCTCTGAAAAAAATGG 

15 ATCCTCCGTCCOTCTCAGCCCCCATCATCCGACCCTGCTTTCTATCTCTGTAG 
ACCTGAAGTCTCCTGGGAACTCTCAGGAGAGGAATCCTATACCCTATGTATCT 
GTGGTAGGCTTCTTTCTCTGGGCAGTGTCTAGAATGTTCTGTGCTCCCTCCATA . 
CCTCTTCCCCAAGCCTGTGCAGGCCATCACTTCTGCTCACCTTCCCATAGCAG 
TCTTCTTCCATGCTTCCTGGTTCATCCCTCCCTCATCTAATCTAGCTACACAGC. 

20 CAGTAGGATGACCATCTTITACTGGTAGTGGTGGGCAGCATGGTGGCTTTACC 
TAGACTGTGGTCCCATTCCCCAACCTCTAAAGACAGGGAGGAAAAGCCAGAA 
ACAACCAAATTCTAAAGACTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 
ATGTACATGTGTGTAGGCATTTATAGATGTGTATATGGTTTTATGTATGTGTG 
CATGCCTGGCATGTGTGTATGTGCCCAGGCACATGTATATTTATGTATGTGTG 

25 CAAACCTGTCTGTGTGTGGTATGTGTGGTGTGTGTGTGTGTGTGTGTGTGTGT 
AATATGCCCCTGGATCATAGTCACTTGTGTTCACCTCAGTCTAGGACAAGTGT 
CCAACCCTGTCTTGGCCCATTTTTTCTCACTATGTTAAAACTTCTGAATC^ 
AAAAGTGGAAGAAAAGAGATTAAGGGCATGACAAATTGCATCAGTTTGGCTT 
TGGTGAAGATGGTGTCACGATGGCTGTAGGTGAGXTGGGAAGTGAGAGAGTG 

30 GGGAGGGGTCAGAGTTGTTGTGTGTGTGTGTGAGCAAATATGGACAGGATTA 
TGTGTTTTAGGGGTATGTTTGTGCACGTGCCTTTGCAGAGGTCAGAGATCAAG 
CTCTGCTGTCTGTCATTCCATAGGAGCCAGTCACCTTCTGGTTTGAGATAGGG 
TGTCTTGCTGAAACCCACAAGTTGGAGGAGATAAGGCTAAATTGACTGTGCA 
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GTGAGCCCTGGGGCCCCCACCTGTCTCCCCAGCGCATTGATATCAAGCCTGAT 
ACACTGGATCTGGTTTTCTAGTGAGCATCAGGAATAGAACTTAGACCACATGC 
TTGTGGGGCAAGCATT^ACCTGGGCTAGCGCATCTTGTTGTTTTGCTTTGTTG 
TTTGTTGGTTTTGTTTTGTTTTTTGGTGGTTGTTTTACAACAATGTTCTTTGAGA 
5 AGCCAGCTGGTATCCTTGACGTCTACTTAATCCCTCCCAAGAACATTGTCCTA 
GAGATACTAGCTCTGAATCTTTTTTTTTT^ 

GGGTTTCTCTGTGTATCCCTGGCTGTCCTGGAACTCACTCTGTAGACCAGGTT 
GGCCTCAAACTCCGAAATCTGCCTGCCTCTGCCTCCCAAGTGCTGGGATTAAA 
GTGGTGTGTGCCACCACTGCCCCGGCACTAGCTCTGAATCTTAAAGACCCCAG 
10 GACCTCTGACAGAGCCACAATGAAGACCAAGCCTCAAACACAGAAACTTTTG 
AGAGGAATTTAAGCTACACTCAAACCACAGAACCCCAAACCCTTCCTCCTAT 
AAGAGGCTTTTACTGGGGGTCAAGGAGTCAGCAGAGCTGTTGAAGGCGGCAT 
TTGACCTGAGAGACTCCTGAGGACTTTTGCTGCATTATGATAAATGTTAGTTT 
TGTAATGACACAATTTCCAGGTCATGTATACAGGTGGGGATGAACATTCTGCC 
15 AGTGTCTCAAAGGCACTGCGGGGATCTGCAGCACACTGGCTCTGCCGCTCTTT 
CCTTCCTCTCCTCTCTCTTTCCCTTCCCTCTCCTCCTCTCTTCTCTCCCAATCTCT 
CTGTCCATCCTCTTTTCTCCTCCTTTTTCTCTCTTCCCTTCTCTTCTCTCCCTCCT 
TCCTCCTCCTCTTTCCCTCCACCCCCACCTCTGCTCTCTCCTTCTCCTCTCCCTT 
CTTGTCTCTGCTTTCCTCTCCTTTTCTTTTCCCTCCTCTTCCTCGCTCTTCCCCCT 
20 TTCTCCTCCCCCCTCCTCCTCTCCTTTCCTCCCCACTTCCACTTCAGCCTCCTCC 
CTTCCCCCCAGGTATCTCCCACCTCTGGAGCCCTCTCAAGGGAGCCTCTCAGG 
TCCCTGGACGCTTCGGGCCTTGCTGGCTTCCTGCTGCTGCGGCTGCCGGGAGG 
CTATTGTTCCACCTGGGTGAGATCACGTCCTGTTTTCAGGTCTTGGAAGTGGA 
CAGGCACGTGTGGTCTGCCCATTTCATCACTCTCCAAAAGGAATCACAGGGA 
25 GGACTGAGTGTTGGATAAAAGTTGTAGCCAGGAGCTTTCTTCACTGCAGTGCT 
GCCTTTCCCCTGGACATCCACCCACTGGGAAGCCCTGAGCCACCTGAGGGCCC 
CAATACAACCGCATTTTCACCTATTTGGGCTACTTCAGTATTATGGTATTGCTA 
AGTGAAGGCTTTCTAACCCTATCCCTCCCTGTACGTTTATTAGCTGACACTCTG 
CTGCAAAACAAACAAAAACAAAAAACCACCCAGCCTTTCCTTCTCCATGaCA 
30 AAGTAAGCAATAGACTACCTTTAGAATGGACCCACTGTCAACTGCTGTGATGT 

catttaccctgaagctaaggacaatccctgtccccagtgtgtccagcaggcgg 
• ctgtccctcagatctagctcctttgtcttgttgacagtgccctgcacctgactc 
actcactttctgagatgtcatttcaggtatttcttgcctgctggtggaattagt 
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ATCACTGTAAGGAACTCTCCCTTTTTATCAGAAAATGATGATTTTTAGGGGAC 
CGTGAGATGGCTCAGCATGTAAAGAGACCTATCACCAGCCCTGATGACCTGA 
GGTCAACACCTGGAACTCATAAATGGAAAACGAGAAGTACATCCTGAAAGTT 
GTTCTCTGACCTCTCCGAACACACATCACACACACACACACACACACACACA 
5 CACACACACACACACACACAGGCGCGAGAGCACCTGCACACAGATTTTAAGG 
TGTGCTGTTTGACCATCAACAACCCAGGCTTTTCCAAATGACGACTGGTAACC 
TATGCATATCCTCATTTACATGCTTGCATCTTTATCCAGCTAAATGCTGACGG 
ATCCCAAGATAAAAACAAACTTAAGTTAATCCTGAGCTCTGTGATTCCGGCCC 
AGTCGCTGGTGAGTGTTCTAAGCTCTACCCTGACATTGCCTGTCATTTATCTCT 
10 GTGATAAGACATGGGGCTCCCTCATCCTCAGTGTGCCCAACCATGTCTCCATT 
TTCCTGGCCATGCTGTGGGTACTTGCCCCTGACTATATTCTTGGACCCAGCAG 
ACTGTAGCTTCTTCTCAGCCCTGACCCCCAGCCCTGTTCCCTGTGTCCCTAGCT 
CTTTAGCATTCACACAACTTTAAAGAAAAGAAAGTGAAGTGGAGAAAAGCAG 
TAAGAATGTAAGTATGTTGTAACTATGGGAAAAAAGGAAGTGGGAGAAGAT 
15 GAACCCCAGTTCCACTCCCCAGTCAGGAGGACCTGCATCCTAGCTTGTTCCAT 
GCACATGGCCAGGCAGAGTGCTCAAAGTTGACCAGGGACCGGAGCTCCCAGC 
TCCATGACTGAGAAAGGGAGCAACCTTCTAACCCCCCCCACACTGTACCAGG 
TAGCATCACTGACGGGGACAGAGCCACGCGGTGGTCCCAGTGGAGACTTATC 
ATGGAAGACCATCATCAGAAGTAGCATTAAGATGGCCATTAGAGAAACCAAG 
20 GGGATCTAGTTGTAGAGAAGATGGCCCACTGACAACCTGATGTGGGGGGCTG 
TGAAGAGGTGGGTGAGGAGGAGGAGTGGTCAGGTGGTAGTGTTCCAGAGCA 
GGAGTTCTCAGCCTCCCTCATCCTGTGTCCCTTTAATACAGTTCCTCAATGCTG 
TGGTGGCCCCCAACCATAAAATTATGTCCTTGCTACTTTATAACTGAAAGTTT 
ACTACTATTAGGAATCGTAATGTGAATATCTGATATGCAACCCCTGTGGTAGT 
25 CGTGACACAGGTTAAGAACATGGCTCTAGAAGCTCAGGAAGAATATAGGCTG 
GGGCTGGAGCTGGATATGGGAGAAAAGCAGGCCAGGTGCGGGAGCTCATGA 
AAGAAAGAGGGAAACCAGCAGGAAGGAGATACCTGGAGAAGTCAGACACCC 
CAGGATTCAGGCAGCTCAGGGAAGTACCTTTGGGAGGACAAGAGAGGACAC 
ACATGCTAGA^iTGCTACATTGTCAGAGTTCTAAACTGGTCCCAGGAAACACA 
30 CTCTGCACCTCGAGAATAATGCTGGGGCGAGCAGGAAGGCTCAGCGGGTTAA 
GGTGCTTTTGCTGAACCTGAGAGCCAGGGTTGAGCCTTGGGACCCATATGATG 
AGAGAGAACCAACTTCTGTAAGTTATCCTCTGATCTCCACTGTAATCTCTGAC 
ACTGTAGCACTTGAGCATGGGCTTGAGCATGTACCTCTCACCATACAAATACA 
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TGCACACACACACACACACACACACACACACACACACACACACACAGTGCTA 
GATGTGGGAAAATAAGTTCTATAGTTCTGGTGTATGTGCATGCGTGCATGCTG 
GTATACATGCATGTGCATGTATGTGCATGTATGTGTGTGTGTGTGTGTATCTG 
TGAATGTGGAAGCTGAGGTCCATGAGTTTCTCITTGATGGCTTTCCTCTTCATT 

5 GTTTGAAATGGGGTCTCTCACTGAACTTGGGGTTCACCAATCTQGCAAGGCTG 
GCTGGCCAGCAGGCCCTAGGGGTCCTCCCGTTTCAGCCTCCCCAGCCCGGGG 
ATTGCACCACATCTGGCTTTTACATGGATCCTGGACATCAAACTCAGGTCCTC 
GTGCTTGTGTAAGCACTTTGCTGACTGGGCACTCTCCCCAGTCCATACGTGAG 
TTTTTTATAGGCTGTAACTATGATTTAACTGCAGATGGAATAAAAAATAAGTT 

10 CAGGCACACTTAGCTAGGGCATGCTACAGAGATCTGCTTCAGAAAGAGCCAC ; 
TGTACCAGTCCTTTAAAAGATGTATTCAAGCAACCAGAGAAGTGAGTCCTGG 
AAGGTGATGGAGGCATCCAGAAGGAGGCAACCTGGAGAAGATGCCATTTTCA 
CAGTATAAAGAACCTGGAAGTTTGTAGAAATCATGGGTTTTTTCTTTATGCTG 
ATGAGTGTTTTGCCTGCATGTATATATTATTTATTTTATGTTAATGAGTGTTTT 

1 5 GCCTGC ATGT ATTTATATGTAC ACCATGTATGTGCAGTGCCCTGGGAGGCCAG 
AAGAGGGGTTAGATCTCCTGGGACTGGAATTACAGCTGGTGGTGAACCACCG 
GGTAGGTCCTCTGTAAGAGGAGCCAGTTCTCTTAGTTGCTGAGCTTTCTCTCC 
AGCCCCAATCAAGTTCATTTTTAAATGCAAAGTAACAGTTGCACTCCAAGGAC 
AAACCACACTCTATAGGTAGATTTGGATTGGGTAAGTTTTCTATTAGAATTCT 

20 TACATCCACAGCTACGAGGAACAAAAGACTATGGTTTATCTGGTTTTGGGATC 
TGAGCCATGCTGGCGTTATTAAGACAATTAGGATGGCTGTATCTGTAGGCATA 
CTTTTGGGGGCTTCTGTATTGGGTTCTTATACCTCTACCATCCTTCCAACCCTT 
ATTCTACCCTAATCCCTTCCAACCCTCCAACACTAGGTAGGAGAGAAAGGTTA 
GAGAGGAATGGGGGTGTAGACTTCTTTAGACTACTTCCTGCTGATTAGGGCAC 

25 TGAGTTCCTTGAGGCTAGTCCAGTCTTCATCGTCAGGCTATCCTATTTCTTCTT 
CTTGTCCTTTTGCTCTTGACCACTTTACAACCAGTGCAGCAGCAGGGGAGCAG 
CAGCTTCCCCTGGCCCTCTCGGGGCTCTCACATTTATACCCTCTCAAGAGTCC 
CCAGAATTCCAGTCATAAGCTATCTGCAGCTGGCAAAAATCATGGCCCTTCTA 
GAGTAGAGTATGAGACCAATCATAGfTGTCAGCTGGGAGCAATCTGAAGCAG 

'a 

30 TCCCATATCCCACATCTGGGATTAAAAAAACATATTdACATACCATAACTGGG 
TTTTTTTTTTTAAAAAGAAACCATAATTCTCACTACAGCATCCTTTTCAAGG^ 
TTCTGGATGGGTCGGCTTTATTTTTTGCCTTCTGTTGTTGTGATAAACACCGTG 
GCCAAAAGCAACTTGGGAAGGACAGGGTTTATTTCATTTTACAGTTTACAATC 
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CGTCACAGAGAAGACAGGTCAGGACTCAAGTTAAGAACCTGAAGCAGGAAC 

TGGGGCAGAGGCCATGGAAGAATGCTGCTTACTGGCTTGCTCCATATGGTTCA 

CTCAAACTGCCTTCTTTTTCTTTTCTTTTTTAAGTTGGATGGTGGATATATTATT 

TCATTTTGTGGACACATGGGAATTTTCACAATAAAAATTTAAAAAGAAGTGC 

AGTTCTAGAAGAGCTTCTTTATGAAGATTACACCTTTGTTTCACTTGCCCTTCA 

AACTCTTTTCGTAGTTGATTGTCCCAAGCCAAGGGTTGGGCTTTAAACAATGA 

AACCATATAAGCAAAATCTGAGTGTGAAATGCCCCCCACAGGATCATATATT 

TTTAACCTTTGCCCCCCCCCCCCCCCACTGAGAAGGGTGTAAGACTTTTAGGA 

GAGGGAGCCCTGCTGGAGTTAGCGGATCACTGGAGATGGGCCTTGAGGCTTT 

ATAGCCAATCTCACTTCCTGTTCCCTTTAGCTTTCTGACCGTGGATACAGTGCT 

ATTAACCAACTTCTTTCCTGCTTGCTTCCATGTCGTCCCCATCACGATGAACCT 

TCTCTGAACTGTGAGCCCAAATAAACACTTCTCCTTTAGCTTGCATCATTGAG 

GGATTTTATCATATGGTAACAGAAAATGAAGCAAGACATAGGATGTCTAAAA 

TITGGTGCTGTTAGTATCTGTGACTGAATAATTATTGGTGCCATTGTTTGTTTG 

TTTGTTTTAATTTTGAGGGAGGTCTGTCCTTCAACCTGCTTCTTTATACAACAC 

AAACTCCTGCTCAGGGGTGGTACCTCATACAGAGGGCTGTGCCCTCCCACATC 

AATCACTAATTAAACAAGTGCCCCACAGGCAGGAGTAGGGGAGCACACTTTT 

ATTCCCAGCACTGGGGAAGCAGAGGCAGGCTAGTGTCGGTGAGTTCAAGGCC 

AGCCTGGTCTACATAATTATACACTGTTTCAAAAAACAAAACAAGCAAAAAA 

GGAAAGAAGGGAGAGGGACAAGGGAGAGGGAGAAGGACCAGGGGGAGGGG 

GAGAGGCCACAGATGTGCCAGCAGGCAGCCTGGTGAGGATGTTTTCTCAGTT 

GACTCTAGTTTGTGTCAAGTTGATAAAAAAAAAATCTAACTGCGTGGCCACA 

AATTTAATTTACTGAGTAGATATGGGACTATCATGTTTATCTGCTTCTTATGAA 

AATTCTGCTAGTTTGTGTCTTTCAGGAAATATTCCTATTTCAGCTAAGTTGATA 

CATTAATGGCACAAAGTTAGATGTGTGTTTCTGATCCATGGGCCTGTGGTTCT 

GTTCTCACTCACTCCTGCTGTTGAGGCCTAGTGAGTTTCCTCTCCTCTCCTCTC 

CTCTCCTCACCTCTCCTCTCCTCTCCTCTCCTCTCCTCTCCCCTCCCCTCCCCTC 

CCCTCCCTTTCCTTTCCTTTTCTTTTTTCCCTTGATCAATATGGCTm 

CATATTTAACTTCTTAAAGCCAGTGCTTGGCTTTCCTAGCn7vCAAGTTCAGAG 

GACACAGCCTGTCATGGAGTAATGGCAGGAGTGACTTTCAGCTGTTTTCCTGG 

GGTGTGATGGGGAGGGTTCCATATCTTTTATAATGAACAAGCAAAAAGAAAG 

AAGGACTAAGTTATCAATCTCAAGATCTACCTCTCAGAAACCCACTCCCTCCA 

gctagtccctatatctcaaaggttctacaacctccctgaaccgtgccaacagc 
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TGGGGACCAAATGTTTCCATACGTGAGTCTGTGAAGGGACATTTCACCTTTAA 
GCCATACATTCTACCTCTGGGCTCCAAGGCTCATTCACGGCCGTCTCACAATA 
TAAAATGAACTATGTTTCTAAAGTCCCTACAGTTTTGTTGTCCAAAGTATAGC 
ATCACCTCTGACATACAAGGCAATGTCTTAACTGTGAAGCCCTGTAAAATAAG 

5 ATAAAAAATAAGTTTCACACTGTTAAACAATAATATCTTGAAGCCAAATGGC 
ACGGGCAGGCAGAGGACATCCAGAGGCTGCTATGATGTGCATGTATGTGGTG ' 
GACAGATGAGAGAGAAAGAGTAGCAGAACCATTTGAGCACTATGAAGAGAG 
ATGGAACTGGAGACAAGAGGGTGAAGAGGAATAGCCTGTTGTGAGTAACCTG 
CCCTGCCACCTGGGGTCTTGGTGAAGTTCCGGCCTATACTGCTACTGAGGGCC 

10 ATGTCTGGGTCCATGGCTGTGAGGCAGTATGGTTCTATGTCTACGTCCGTGGC 
TCACATTACCACTAATGACCATGCAGACATCCCTGGTCTGGGCTGCTTCCTGG 
GACCATGTTGATGTCCAAGGCTGGGCAAAATAAGCAGCCTTGACCCTACTCTT 
GTCCAGTGTTCTCATTGTGAAATAAGAGTGCTCTTATTATAAAATAAGGATCA 
CTATTTCTCATATATACTTCCAACACAGAATGGCACAGAGTAAACCTCCGAAT 

15 TCCAAAGGGAGGGATAGGGACCAAGCAAGAAGAGACTGAAGTCCAACCTGA 
AAAACAACCATGTAGCTCCATGTTCAGCATTTAGGACTAATGATAAAACCAT 
CTGAGGTCCAACAGCCATGGGTGGCCCTGCnTCCCCAGeCCCTCTCACTTTGC 
CTGACGCCATTCAGTACTTGCAGCTTTCCTTAGTGGAGCCCCCATGGTTCTGG 
TATATCTAACCTCCTGAAGCCTGTAATGCAATCTAGGCTTCCTCTTTACAGCTT 

20 CACATAATAGTCTCTAAAGTACTTCTTGAAGAGAATCCAATCCTGCCACACAT 
AGTTGGTGGTTTTCTTGAACCTTGGGGCAAATGTCTGTGACCTAATAATTTTTC 
AATTTTGCATGTTTGCAAAACCAGCTCTATGAGATCACAACTGCCAAGTTCTG 
TTGCCAGCTCTACATGCAGCTTGATAGCCTTGGACCACACCTGCTGTGACCTC 
TGTGTGTCTGTCCAGCATAATCCTGGAAAAACACTTCCTAGATAGTTAAAACA 

25 AAGAGGGGACACTGTGGCCACCTTTCTATTCAAGCATTCTCTTTCCAGTACAT 
TTACATTTTATACATTGTAGGTCGATGGGTGCGGTCTTGCCTTCAATATATCTT 
TTCTATTGTCCAAGTACAGAGTTCTGGCTTCTCTTTTACTGAATTAATTTCTTT 
AGTAATTAGTGCTGCTTTGTCCATAGCTGCCTTTATCTGCAGCCATAAATTCAT 
- fCTTGCTCCTTTCATTCGATGCTACACATTTTCATACCTTTCTACCCTGATTTTT - . 

30 ACTTTCTCTCACTGTAAATCTTAGTAAATACAGAAAAAATAACCACTTTGTtA 
TCTTTAAATGTCCTCTGCCAAACAACTTAGTCTATTATTATTATTATTATTATT 
ATTTTCATTTTTTCAAGATGAGGTTTCTCTATGTAGCCCTGCCTGTCCTGAAAC 
TCCTTCAGTAGACCAGACTGGCCTCAAACTCACAGAGATCCACTTGCCTCTGC 
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CTCCTGAGTGCTAGGATTAAAGGAGTACAGCACCATCGCCCAGCTATTCTATT 
ATTTTTCAATTCAGCCATATTCAAGTACTAAGAATAAAGACAGTCCTTGTCAA 
AATGTAACACAAATGGTCTTTAGCTCAATTCCTAATGGAGTCCTTATTCTCTG 
AAACCACATGCACACAGTCTTCGGTGTCTAAGTTATTATGGGCATTCAGGTCT 
TCAGGGCTCCCAACTCGAGTTGCCCAGTGAGTTCTTCTTACAGTATTCTAGAG 
CTTTGTTAGCTCACAACTACAAACTCAACCATGTCCACCTCTAAACCCAATCC 
CTAAAGCCTGTGCAGTACATGGCCAGGTTTAACACAGAAACTACTTCTGCCCC 
ACTTCTGGTACCAGTATTCTGTATTAGTTACTTTTTGTTGCTTTGACAGAATAC 
CTAATAGAAGCAACTTAAAGGAGAATTGTTTATTTGGAGTTATAATTCTGGGG 
ATACAATCCATCATAAGAGGAATGGAATGCAGAGGGAGGGCTATGTGGTGGT 
CCTTTACTTCTTACCAGTTAAGAAGCAGAGATAGGACAGGAAGTGAACAGAG 
CTAGAATATACAACTCAAGACTTGACCTCCAGTGACCCACTTGTGCCAACTGA 
ACTGACTTCACCTCTCAAAGGGCACAAAACATCCCAAAACACACTCTCTGGG 
GAATTAAGTGTTCAATACATTTCACATTCAAACCATAGCATCTTATCCATTTT 
AAGGTGCAGAGTTCAAGGGAAAACCAAACAAACAAATTCACATTGCTGTACA 
GTTATCACTGCCATCTGATTCCAGAGTGTTCCTACCTTCCAAAATTACGGTCC 
ATTAAGCCTGAAGTACAATTCCTTCTTTCAGCCCCTGCCCCACCACATCTATT . 
GTGATGGTTAATCTTGACTGTCAACTGGACAGGGTATAGAATCACTTAGGAA 
ACACTCCCTATCTGTCTGTGGGTCCATGATGTTTCTAGAGATGCTTAACTCAA. 
GAGGGAAGATCTACCCTGAATATGGTAACACCTTCCTAAGTCTGGAGTTCCTC 
ATAGACTAAAAAGAAAGGAAGCCAAGTGGCAGCATCCATCCTTCTGCCTCTT 
GACTTAGATACAATGTGACCAGGTGCTTCAGGATCCTGCTTGCTAAGTCTTCA 
CCACTGCAATGAGCTGTGCTCCTGGCTGTGCCCTTGTGCTGTGAGCCAGACAA 
ACGCTTCCTTCCTTAAGTTGCTTCTTGTCAGACACTTGGGCATAACAGTAAGG 
25- CAATTATCTGGTACTCATCCCACCTTATGTGTCCTTGAATATGATGGCTCTAG 
GGACCTCATAGAGACTGAGTCACATGGTACCTTCCCCCTTTATTTTTGTAGAA 
TTTATirrTGTAGAAAAAACGCAAAATAAAAATAAGTAAATAAAATriATrm 
TGTGTTCTATCATCTATCTATCTATCTATCTATCTATCTATCTATCTATCCTTCC 
TTCCATCTATCTATTI'.TACTCCATCCATCCATCGATCCATCCATCCATCCATC 
30 CATCCATCTACCTATCTATTTTTACTTGGGCCTGTCAGTCCCCATGGAACTAGA 
CTTGTACATGAGGGTGGGCTATTTGAGCGGGTGCTGGGTGGGAGTCAAACTT 
GGGTGAAGAGCAAGTGCTCTTAACCACTGAGTCATCTCTCTAGTGCTTGTCCT 
TTTTGTGATTAGCTTTTTGCATTGAGTATTGTGTCCTAAAAGCCTATCAATTTC 
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CCACATTTTTCCTCTCTCTTCTTTCTTTCTTCTGCCTCACTATGTAGACTAGGCT 
AGTCTCAAACTTGTGATCCTCTGCCCTGGCCTCCTATGTGTTGGATTTTAGGTA 
TGCATCACCACATGTGGCTAGATGCCTGCCCTCTCTCCCTCTCTCCCTCCCTTC 
CTTCCTATCTGTCTTTTCTTGAAAAAAGAACTAACATATCTCAGGCTGGCCTT 
5 GAACTCATTAAGTAGCCAAGGATGACCTTCTGCTTCCATCTTCTGAGTGCTGA 
GATGACAGGCATGAGTACTGTGCCGGTTTATGGAGTGTCAAGGACTGAACCC 
AGGGCTTAGTGCATGCTAGGTAAACATGCGTCAGACCGAGTCACATCTCCAG 
ACCTTCTTCACCTTTAAAAATTATACTGATTGGTTGGTTAGGTATGTGTGTATA 
TGTGTACCACAACACACATGTCAAGGTCAGAGGACAACTAATTGATTATCTTC 
10 TTCCACTATATGGGTCCCTGGTATCTAACTCAGATTGTCAGGCTTGGTGGCAA 
GCACGATTTACCTACTGAGCCATCTCAATGACCCTACACCATTTTAAAAGGGA 
GTTATTTCTTTCTATATGTATGGGGGGTTTGCCTCTGTATGTCTGTGCCCCATG 
CCCATGAAGTGCCCACAGAGACCAGAGAGAGCATAGAGTCTCCTGTAACTGG 
AATTACAGATAGTTGGGATCTATCAAGTGGAGGTTAGGAACCTAACTCAGGT 
15 CTTCTGCAAGAGTAGCAGTCACGGAGCCATCTCTTCAGTCCCACTCTCTCCTT 
CATTTTGAAGAATAGGCCAAACTACATTCTGCTTATCCATTCACCCACCAGTG 
GATGTAGGCTTGCTCTGCATATGCCTATTTTAAGAAATGCTGCTGTGGACAGA 
TGTATGAGTATTGTGTCATCTCCTCTTAGTGTCTCTGCATACAAATGATTGTTT. 
CTGCATGTATGCATATGTATCATTTTCCTCTGACTAAGAATGTTTTTTCCTTTC 
20 CTGGTTTCACATAATGCGATTGCGATGTATCTTCTTTGTTCTCAG^ATTTTGCTG 
AGCTCTCTGGATCTGTGAGCTTATGATTTTTATCTAGTTTGGAAATTTTTCAGC 
TCTTTTCCTCTCCAGTTGTATTTTCTGCCTAATTTCTTGTTCTCATTCAGGAGTG 
TGTTTGGCACCATGCTTCTCCTCACCGTGCCTGTCTGTGCTGACCTTTGGATGC 
CACCCACTTCTATGCCTTCAAATTCATTAACCTTTCTTTTCACAGCATCCAATT 
25 TGCCACTGGTGCATTTTCCATTCCAGATGCTGGCGTTCTTGTTCATCTCCAGAA 
CTTTAGGGCTTGTCTTCGCTCATGTAAATACCTCCTGTGCCGCCACTTCACCTT 
CAGCCTGTGAAACCCAGTCACAGTAATATTTTTGCTGACCTTGTCTATGACTG 
GTGATACTGTTCACCCCACATAGTTTCITrTGTCACAGAAATGAAAAAAGTTA 
CTGAGAC AGT ':■ GGGTATT AGAC ACTGC AACGT AAATCTTAAAAATATAT ATAT 
30 TTTAAGATTTATAGGATGTGTGTGTGTGTGTGTGTGTGTGTTTGTGTGTGTACA 
CATGTACATATGGATGCCCACAGAGTGCAGAAGAAGGCATTGGATCCCCTAG 
AGCTAGAGTGTTAGGAATTTGTAAGCCACTTTACATAGGTATGGGGAACTGA 
CCTCTGGTCCTCTTGCAATACTAATAACTGTTCTTAACTACTAAGCTATCTCTC 
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TAGCTCCTTTTTCTTTTTAACm^ 

GTGTGTGTGTGTGTGCGCATATTTGTGTTTGGACATTTAAGTTCGTGTATCTAT 

GTGTGTATATGTTTGTGTGTGTGTATATGTATGTAGATCTTTATATATGTGTGT 

GCATGTTTGTATGTATATGCACATATGTGTGTTTGTAGCATGTATATGTGTGTA 

TTTATGTATGTTTGTGTGTGTTCATGTGCACATAAAAGAAGCATGTGCACACA 

GAAGCAAGTGCATGTTTGTGCATATATGTGTATATGTTTGCATATAGGTATAT 

GTGTGTATATATTTGTATGTGCACACATGTTTGTGTGTGTGTGCGTGTGCGCA 

CGCACATGCACAGAGACCAGAAATCAAACTCAGGTGTATTTCTCTGAAAGTG 

AGGCTTTTGAGCATAGTCTCTCTGGGACCTAGGGCCTGCAAGCTCCAGGGATC 

CACCTGTCTCTGTTTCCCCAGCACTGAGGGTTCTGGAGATCTTGCCTGCATGG 

CAAGCACTTTACTGACTGAGCCCTCCTTCAAGTCCTTCTTCCTTGTTTCTCTTT 

GACTTCAGAGTGAGCTACATGTCACCTGGATCTTCCTTTCTGCCTTTATTGCAT 

CTAGCCTGTGCTTGGGACACTTGCTTTCCAGGATGTCTTGCCATCGGTAAGGA 

CACCCTAGAATATTTGCTGTTGTATATATGGCAGCTTGGTCCAATGTCTGATG 

ATTTGGTAAAGGTGTCTTCTACTAGCAGGCCAAGTGGCTTCTTCCCAGTGGCA 

CTGTCCACTTCCAGGGCATGGCTTGGAACCAAGCTGTCTCTCCAAGGTGTCAG 

TGAGGGTTTCAGGGTGTCATAATGGTAGGCCATCTATGCTGAGCCAGCTGATC 

CfCTGCCTGTCATTCACCAGTAAACTTGGGGAGTGTCCCCTTTTGTTCCATGA 

GACCCAGGCATGTTATAGAATGTAGACTCCATTGTTGAGTGCCTCAGAGCCAC 

CTGATAGAGGTTGTTGAAAGATGGCTGGGTCTGCGCCATTCCCAAGGGGATG 

GAATGGACAAGGGAGCCACACAGGCAGATGGCTCTCAGAACAGCATGTGGA 

AAGCTGAGACTGAATGCCAAGGGCTGGGTCTCATCTGACTGGGTTTGTCCAG 

GTTGTCCCTACTTGGAGATGGAGCTCAGAGAGTGGGAGGCAAACTCCATTCT 

CAGGAATCCAGAGTAGACGTGGGGCATCCTCCAAGGGCAGAGTGGAAAfCTA 

GCTGGAGAAGAGGTCTGGCCACAGATGGACAACTAGAGGGAAAGAAGACAG 

ATGGTACTGATGTCATGAGTATGGAGACTTCCCTGTGAAGGAAACATGGGAG 

TGACATGGGCTGAGGGAAAAGGAGGACAAGATCAGCCTGCCATGAATTAAGT 

TTGGGAAGCTGTGGAGCATTGAGGTTCAGTCCACATGTCACATGGATAAAAA 

CTGTGGCTTGTACCAGAAGCTTTGCC|rGCGTTAGTTCTGTCCCAAAGCTAAAG 

GGKjTAGGGGAGATGGTTCAGTGGTTCTATTTTTTTTTTTTTT^ 

TTGTGACAGGGTTTCTCTGTATAGCCCTGGCTATCCTGGAACTCACTTTGTAG 

ACCAGGCTGGCTTTGAACTCAGAAATCCGCCTGCCTCTGCCTCCCAAGTGCTG 

GGATTAAAGGCATGTGCCACCACCGCCCGGCAAGAACACTAGTTCTTAACGG 
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CTCTTCCAGAGGACCCAGTTTCACTTGTTGCTCTTCCAGTATACACAGCTTCA 
GTTCCCAGCACCCATGTAGTAGCTCACAACCACTCCAGTTCCAGAGAACCTGA 
GTCTCTCTTCTTGCCTCTGTGGATACTGCACATATATACATGCAAGCAAACCA 
CCCACACTACAACATAAAATAAATATTGGATTTTAAAATTTTAAACCATTAAA 

5 GCAGTTAAAAAAGGTAAAAATTCAGGCTGAGATAGCTAAGTTGGTCATGTGC 
TGGCTGTGCATGAGAACCTGGCTTTGAACTCCAGAGTCTACTTCTAAAAGCCA 
GGGATGCTAGCTTGTAATCCCATCACTGGGAAGACAGAGTCAAAGGATCCCT 
GGCACCAGCTGGCTAGTGAGCCCCTGATACAAGCCAGTAAGAATCCCTGTCT 
TAAAGTTTGGAGCTGAGACGAAAGGATGGACCATGTAGAGACTGCCATATCC 

10 AGGGATCCACCCCATAATGAGCTTCCAAACGCTGACACCATTGCATGCACTA ] 
GCAGGATTTTATCGAAAGGACCCAGATGTAGCTGTCTCTTGTGAGACTATGCC 
GGGGCCTAGCAAACACAGAAGTGGATGCTCACAGTCAGCTATTGGATGGATC 
ATAGGGCTCCCAATGGAGGAGCTAGAGAAAGTACCCAAGGAGCTAAAGGGA 
TCTGCAACCCTATAGGTGGAACAACATTATGAACTAACCAGTACCCCGGAGC 

15 TCTTGACTCTAGCTGCATATGTATCAAAAGATGGCCTAGTCAGCCATCACTGG 
AAAGAGAGGCCCATTGGACTTGCAAACTGTATATGCCCCAGTACAGGGGAAC 
GCCAGGGCCAAAAAAAAATGGGAATGAGTGGGTAGGGAAGTGGGGGGGAGG 
GTATGGGGGACTTTTGGGATAGCATTGGAAATGTAATTGAGGAAAATATGTA 
ATAAAAAATAAAAATAAATTAAAAAAAATCCCTGTCTTAAAACAAAACAAAA 

20 CAAAACAAACAAAAAACCCAAACAACAACAAAAGGATAAATGGCACAACAC 
CCAAGGTTGTCCTCTGGCACATATTCTACACACAGTGCATGTATGCACCGATA 
CACCCTCACACACATACTCTTATGCACACAGTATACATATGCACCCACACATC 
CTCACACACACTCTTCTACACACAGCGTACATATGCACTCACACAACCTCATA 
TGCATACTCTTATGCACACAGTATAGATATGCATGCACACACCCTCACACACA 

25 CTGTTCTACACACAATGCACATATGCACCCACACACACTCACATACATACTCT 
TATGCACACAGTATACATATGCATCCACACACCCTCACACACAGTCTTCTACA 
CACAGTTCACGTATGCACCACACACACTCACACACATACTCTTATGCACACAG 
TATACATATGCACCCACATCCATACAAACATGTATACACACACACACACACA 
CACACACACATCAACATTAAAATAACATTTTGTAGTACAT|.AACAGGACCAG 

30 AAGTTCCAGACAGTGTCTATACTTCTTTTGCTGGGAAACATTTATGGTGTTTTT 
GTAATGTCTGTGGCAGCATTTGCTCCCTAGCATGTGACACTGACTGTATAGCC 
TGCAAAGCATGACAGATTTGCTAACAGGTCCTCAGAAAAAGCTGTTTTGCCCC 
ATGCTTGAAAGCTCAAGGATGGAGTCACACAGCCAGGGATAAATGACCCCTG 
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-CAAAGGAGAGGTGCCAAGGAGCAGGCAGGGCACAAGCCATGTCAAAATCTT 
CCAGCTGATTTTTGAGACATAATCATGCACCACACACTTCATTGAACCATTAA 
ACCACAGCCATGTAAACTGTAGCATAGTCAGCTTGTGTGCATGTATGTGCCAT 
AGCCAATTGTCCAGATTGTGTGGAGATCAGATTCTCCAGAATATCAGTCCTTG 
5 CCTGCTACCTTTCTGAGACAAGGTCTCTCTGTTGTTGTTGTGTATTCTAAGCTA 
ACTGACCCTTGAGCTTACAGTGAGTGTCGTCTCCGTCTCCCATTGTGTCATAG 
CAATGCTGAGGTTACAGGAATGTGTTCCGATGCCAGGCTTTACATGGGTTCTG 
GGGATTCAACTTCAGGCCACTGTGCTTGCATGGCAAGCACTTTATCTGCTGAG 
CCATTTCCCCAAGCTCCTTAGAACATTGTGATTAAAAGGAAATTCATACCCTG 
10 TAGCCAGTACCTCCCCCTCATTCTTGTCCCCGTGTCTATTTCTGTCTCTGTGGA 
TTGGCCTGTTCTGGACAGTTCTTGTAAATGACATCATGTAATATGTGGTCTTTC 
GGTTTTTCCTTCTTTCACTCACTCATGTCTACCAACACCATGATGCCAGTCAGT 
GTTTTATTTCTTCTCTTGGCTGAACAATATTTCCTTCTGAGTCAGTTCATGCTTT 
ATCCATCCATTTGTCTGTTGATGGACATCTGGCTGTTTCAAAACATTTAAATG 
15 AAGATCACATGGCTAAGGATAGAGCCAGAGCCCAAGCCCCCCAAAAGAGAT 
GTATGAAACTCTTATGATCAGGGGAGAACTGCTTCTGGAGCAAGAGAACTGC 
CACCAGGATGGTTAGAGGATGTGGGTACGTGAAGTTAAGAAATGAAGGGGC 
ACACCTAGGGAGACGCCAATCCCTAGGGAACTGTCACCTTTAAGCTGAGGAA 
GGTAGGTTGGGCTTGCAGAAAGTACGAGGGCCAGGATGGGGGAGGGGCCAA 
20 GGTGGGAAAGGTGCTGAGTAGGATGAGATGCTGGCTTTATAGAAGTAGGGTA 
TGTACCAGCCTGAGAGGAAGGAAATATGGTGAGACCCCTGAAAGAGTCGTGA 
GTCTTTACACACACACACACACACACACACACACACACACACACACACAGAC 
ACACACACAGACACACGGTGATGTACCTAAGTTGCTAGATTGCCTAGAATGC 
ACCACATTGTCTAGCATGTATCATATATGAAGCCCTGGGTTCCATCTCCAGCA 
25 TCACATACAATGAGTGTGACGGTGCATGCCTATAATCTATGTGGAGATATAAG 
AAGAAGATAGACGTTCTGAAGTTCACAGTCATCCTTAGCTATGTAGTGAATTA 
AGGGCTAACCTAAGCTTCAGGAGACTCTCCCACCACCACCCCTAAATGGAGG 
AAAAGTACAAAAAACAACTTAGGGAAAGGCAGGGAGGAAACAGGGCAGCAG 
1GGTCTCCGTACAGTGAGGGTTTGCAGATCAAACCTCCTCAGGGCACCAGGA . 
30 TACAAACTGTTTCCATCTCTTCTGTATATGTGTGCATGCTTGCGTGTGCAAGG 
ATGAGCACAGCAGTGTGTATGGAGGTTGGAGGATATCTTGGATGTTGGTCCTT 
CCCTTCCAGCCTGTTTAAGATAGACGTTGTTGTTCATTATTGCAGTCCTGGGG 
CTAGCTGGCCTGGAAGTTTGAAGGAATTCTCTTCTTGGCATCTTGGGTTTCTAT 
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GGGTTCTGAGGATTTGAACTCAGGTCCTCACACTTGCACAGCAAGTGCTTTAC 
TTAACAATCTCCCCAGACCAACTGGATTGCTCCAAACAGCAGAGAAGTGACT 
AGACTTATCCAAGTGCTCTTAGCAGACACCCGGGAAATTTCGACTCACTGCCC 
ACTTCCCACAGCAAGGTTATCAAAAGGCTGGATTAGGATGATACAGCCTTGA 
5 CGAATCCTGGAGAGTCATGGAACCTGCAGGCAGCCCCACACTGGCAGTACAT 
GCfCCAAGGCAGTGCGAAAGCAGAGAGAAGCCTGGTTTTGGTAGCCCTCCCC 
CAGTCGGATATTTACAAAGAAAGTAACTCCAGGTTACTACCCACAGGATAGA 
CCACTTGAACCCTATAGGTCTTGCTTGGCTCCCAAGCACCTACCTAGTCCTTT 
GAGGGAAAAATTGGGCTTGCTACGGCCTTGAGAGCACACTTCTGGGACAATG 
10 GGGCAAAACTGAGACTTTGACAAGGTCAGTGCATTTCCCTTCCCCTTCAACCT 
TCCCTCCACCGGCCTCAGGGGAAGGATGGGACTGAGAGGTCTGCTGGAATCC 
GAAGAAGTGTTATTGAACAAAAAGTCCAGGAACCGAACAAAAACTAGTCTGG 
GTAGATTCACCATTAGGAAAAGAGACCCAGACTTGGAGAGCTGGCTGTAAGG 
TGCAGATGGGTGACAGTGTGAGAAAGGAGTGGACTTGATTGTATTCTCTCAG 
15 GTCAGGGCAACCGGTCCTGGGGATTCTTGAAAACACAGAGGCATGGGGTGTG 
CATGAGGGGCTAAATGGGTACAGGTGCCCAGGAACATGAAATTTCCCATTAC 
TGGTCTGAACCACTCCTCAAGCCCTTCTCACACTCAAGGGGCACAGGTGTGCT 
CCTGCAATGCGTGGACACCGGGTCCTGGTGGAGGAGCAGGGTGGGGCGGAGT 
GGGCGTGGAGCCTTGTGCGAGACACTCCCCCATCTCTGGAGCCACGCGCCAG 
20 GCGTACGCTTCCTTCTGGCCCCGGCATAGGACCGCGCCAATTGTCATTGGCCA 
AACGGGCGATCCCAGATTGGCTGAGACCCCGGCTCCCGCCTCCTCCGCCAGG 
GGGAGGGACGTGGGTGCGGGTTTGAGGCTCGTGCTACCGTAGGGGGTCGCGC 
TCTTCTGCCTCCTACCTCTTGGTCACCGCAAAGCTTGGTCCGGTTCTTCATCCG 
GCTGCAAGCGCTAGGTGTGCGGAGACCTGGCAGCTCTTGGGGCTTAAGGGCT 
25 GAGCACCAGGACGGGTGGAGGTGCCTGTAGAGTACATTCGGACCCTCTCTCG 
GACCCTCTCTCAGCCCCTGAGTGTGCGGGACCTGCGGAGCGCAGTTCGGGAT 
CTGCACTCGAGGATTTTTCGAGGACGCAATAAGCTAAGCATCTGCCCGGAGC 
ATGGAAGCACGTCAGTAGGCCATGAACTGCACCCGGGAGGGGTGGGGGTGG 
AAGCGCACGGTGTCAQCTTTGCAGAATGTGTACGCCAAGGGGAGGGTGAAGC 
30 GTGGCGGGAGGGCGAGGCGAAGGAAGGAGGGCGTGAGAAAGGAGGCGGTG 
GCGGGCGGAGGAGAGTTATCTATACTTTTTAAAAAAAAGGAGCCGCTTGAGC 
CGCGTAAAGGGAGACTTGGGGAGCGCCTGACAGCACGCGCGGGACACGAGA 
GTACCACGCTTCCCTACTCTTTCAGACCTTGACTGGTACGGGGTCCCAGGACT 
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GCAGGAGGCCAGCGACGCGTGCCCTAGGGAGTCCTGCAGOAGTGCCCTGCCT 
GAGGCCCGTGAAGGTGCAAACGTCCACTTCCCACCGCACCCGGTTCCTCGCG 
AGCACTTTTCCTGTGCCGCACCAGAACTCGTAGCAGGGGCCCAGGGGCTGAA 
TGCAAGCTTGATGGACGGCGGCGCGCTGCCCAGACTCATGCCGACCTCGTCTG 
5 GAGTCGCTGGAGCCTGCGCTGGTCGGCGGAGACAAGCGTCTCCGGAATTGCT 
GCGCTGCAGCCGGCGGCGGCGATCTGGAGCAACCGAGGCCAGCAGCAGCTCG 
GCTGCCGTGGCACGCCGCAATGAGCGCGAGCGCAACCGCGTAAAGCTGGTAA 
ACTTGGGCTTCCAGGCGCTGCGGCAGCACGTGCCGCACGGCGGCGCCAACAA 
GAAGCTGAGTAAGGTGGAGACGCTGCGCTCCGCGGTAGAGTACATTCGTGCG 
10 CTGCAGCGGCTGCTCGCAGAGCACGACGCGGTGCGTGCCGCGCTCGCTGGGG 
GGCTGTTAACACCCGCTACTCCGCCGTCCGATGAGTGCGCGCAGCCCTCTGCC 
TCCCCTGCCAGCGCGTCTCTGTCCTGCGCCTCTACGTCTCCGTCCCCGGACCG 
CCTGGGCTGCTCTGAGCCTACCTCCCCGCGCTCCGCCTACTCGTCGGAGGAAA 
GCAGCTGCGAGGGAGAGCTAAGCCCGATGGAGCAGGAGCTGCTTGACTTTTC 
15 CAGTTGGTTAGGGGGCTACTGAGCATCCCACCCCCCTAAGGTAAGTTCCAGG 
ACGGCGGGGAGGCGAAGCAGTAAGGGAGACACGTGGTGGGCGGGCCTGACA 
CTTAGCGCCACGGGGACCCTGTGCAGCCAGGACTCAGCTGGGGCGATCACTT 
GGATTTCGCGCACGCTTCATTTTTCCTCAACCTTITTCAAGCCTGAGCAAGAC 
CGGCGTTTGTTTGTCCGGGATTGCAAAACTTCCTCTCAGAGCTCTGCTGTGGG 
20 TGGGGGAAGGGGAGGCCAGGGGAGGGGAGCGGCCTCAGGGCCGGGCCGGCG 
AGGTCCCAGTGCTGTCAGAGCCTGGCCGAGTCGGGTGCTGGAGGCGGGGTGA 
GTTTGCATTGCAAATCGCGTCCTAGGCCGGGGTGCGGAGTGAGTCGGCTGGA 
GCGGGCCCCTGAGTCACGGCGGGCAGGTTCTGAGCGTGCGCCCCGCCCCCGT 
CGGCGCCTCTGAGCGGATCGAGGCACCCATGAGTTGAGACTCCAAAACTAAT 
25 CAAGCAAACGAAACTGCCTACTGCGCCTTGGGAGGTGGGGCGGTGTCCGTAC 
ACATTGACACCTTTTATCTTCTTCACAGCTGCATCCCTGGGTGACTCCTGGTGG 
ACCTACCTGCTTCTAGCCCAAGAAACCTGGGCCTATGCCTTACCCATGCTGTC 
TAGTGCAGCCTGACCAAATGCCAAGTGCTGACTGACCTCTGCTCGGCCTCCAC 
GCCGCGG AATGACATCTTCCATCTTCCGGAC'5-TTGCAGCATCAGGACTTGGAA 

30 ATTTCTCAGGATAAAGATTTTTACAATGACAATCTACTTnT 

GAACTGTTGTAGGACrCTACTGAAAATATGAAGAATTATTTTTATACAAAGGA 

TCCTTAAGCTTGGAGCACAATAAAGATGACCTCTGTCCCTCACCCCCACTGTC 

TAGAACTTTCCAACCTGGCCAAAGTGTGGACCGGTTGGGCCCTGAGGGCAAG 
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ATGCCTGGCTGCACCCTTCTTCCTCTTCTGAAGCCTATACTGACGCTGATGTTT 
GGCCAGTGTGGGAACCCTGCTGTTGCAAAGTGTACTATTCTATAAAAGTTGTT 
TTTCATTGGTGTTTGCAGTGCCTTGTACCTTCTCAACCTTGCATCTGTCTCACC 
CTTGTTCCCCAGGTGCGGTCTAAGCTGGGGTGGATAGTACTAGCCCAGATTTC 
5 TCTCGAGACCTGTATTGCATGGGTGGATACAAAGGTATTGGGTGCCTACTCTA 
GCCATCTCTCCAGAGATGGAAAAGCAATTCAGACACCTAGGAAGGCAAGGTA 
GCTGTCTGTGAGCCTGTGTCTGCCCTCCAGGGTGGCTCTCCCCTGGGAAGGAT 
CTGTGTAAGAGGGGCAAGTGCAGGGTGCTAGGCTGAAGGCAGAGCTCTGGAT 
TGTGGGCATCTCTAGGCCCAGGTCTCTCTTCCTTACCACTTGGGTGGCCCTTG 

10 GGAAATTGCCTTATTTGCAAATGGTATTGAGCTTAAAGCTCCCTCCCAGGTTT 
TGGATAACCTGCCCATGAGCTTGCAGCCAGTGTGTGCTAGGCACCCGTGCTGA 
CTGCTAGGGTGAGTGGCCATCGTGAGGCCCTGCAGATAGATCTTTACTTTAAG 
ATTCTCTAGTGAGCAAGGAAGACCTGGAAGCTTCCTCAGTACTCCCCACACA 
AGTTCCCTAGCCTAGGGAGCCAAGCTGTGATTGCTAAGATATTACCTGGCTCC 

15 ACCTTTGACCCCCGAGCCTCCGAAGCTTTGAAGTCTCCGTGTGTCCAAAGCTC 
CCTrTTATTGCAGGGGTTTGGGAGGGGCTGAGGGGATCCCCAGGTGGTTTTAG 
GGTGCTTCAGGCATCCCTTCAGAAGGGAGTGGTCAGGGCACAACCGTTGGAG 
CTATGGGAATCAGGAAGTGCTGTGCAATGGAGCAGATGCCTTCCAGGTACCT 
GTGGTGGCTATACAGAAGGCAGTATACAAGAAGCTCAATCTGTATTATGATA 

. 20 GCTGGGCTCCTTCCCTGCAGGACCCAGAACCCCAAAGCCAGGGTCAAAGTTG 
AATCTGTAACTTTGGCCCCAGCTTGTCAGCTTCTCTGATGAAGGTCAGGCTCA 
GTTGGGGTCAGGCCAAGCACAGCTTGGAGCCACAAAACTGAGTGGACCTGCC 
TTAGAGGACAAAATGGGGCATGGCAAGGCCATTGAGGGAGGGCCACCCTCTT 
CCCAGGAGCCCGTTGCCAGCCAGTGTGTTCACTGAACTAGGGGACCTGAATT 

25 GTCCCCTGATTTCTTTCCTGTTTCTTAAAGTGGCTTATGGGGAATGGAGGGGT 
TGCTGGAAGCGTGAGCTCCCTTGTGTCCATAAATTCCCAGTGGTGTCAATGGT 
GTGCCCCTGTTGTATTATGGGATATCTCTGAAGTCTTCATTAAACAACTTTATT 
GGCTCACATCTGTAATCTGAACACTTAGGAGACTGAGATAGGAGGACAGTCC 
AAGCTCAAAGGGGTAGGGATGAGGATGGGGGCTAGGAAGGAGGAAi.GGCAG 

30 ATGGCTCAGTAGGTGACAGTGCTTGCTGTGAGAGTATAAGGGCCCGCATTGA 
ACTCTTCAAAACCCACAGAGAAATTGGATATAAAGCACATCTGTAATCCCGG 
TGCTCCTATGGAAAGAAGGGAGGCAGAGACAGGGGAATCTCAGGAGGTATG 
CAAGCCAGCTAGGCTGGCACACACAACGGAAAAACAAGAGCCTCCTATGTCA 
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AACAAGGTGGAAGGTGAGGAACAACACTTGATGTTGTCCTCCGATCTTCATG 

CATGAGTAGGTGCATACACAACCCAGGCACAAAGGAACCCTTCTGCTTCAGG 

AACTGAGGGTGTACTGCtGTGTCTGACAGAAACTCCAACCTTAGGGAACTTA 

CTTTCTGCTGGAGATGAGCCTATTGGCAAGAGGCATGTACCAGCTCATGGGTA 

ATCAAGTGAAAGAAACTGCTTAAGCCCATAGCCGGGTTGTGGCTTAAGGAGG 

CTTCTGCAACTCCACCCTGCTGTCTTCCTCCACGTGGTTTTGAAGCATCTCCAG 

ACTGCACAATTTCCTTGCCTAAGGAAACCCTTTATCACCCATGGTGATGAAAG 

TGCAGCTTGCCAGCCAGTGGCACCGTTTCTTTTCTGATGTATAGAACCAGGGT 

CCATCCCATGTCTAAACACCAAACTTGCAAGGTCTCITAAGCCTCTTGGAAGC 

AATTTGTCCCAGGGATCACGTGCCACACGAAGCATAGACACCACACCTGGGA 

GACCAATCTCGGGTGGATGCCAGCTTCCAGACAGATAATGTGTCCTTTGTATA 

CCAGTCTAGGGGCTGGCCAGGCTACAGCAGAAGTCTGGAACTGTATCCATCT 

GTCTATATAGTGCTGACCTCGTGGGGTAGTCACGTGTCCTGTGATACCCCAGG 

GTTCAGCTGAGCTTCTAGGTTAGTCTGTCAATCAAGGCTATGTTACTCAAGGT 

GTCTTTCATTCCCGGGCAGCACCATTGAAGACACAATACTCCCCAGATACTGT 

CTTCCCAAAGGGGCCCAGGCAGTCCTGGGAGCCGAGCTATCAATCAGCCTTT 

AGAGTCCCAGAACCCTGAGCTCTACAGGGACATAGAAACAATCCTCCTGCTG 

CAATATGCAGGCTGTCCAGGGCAGCTTGAGAAATGTCTGAGGCACAGAGCAA 

AGCAAGTGGAGTTTCAGCAGGTTCCTGTCCGTCATCTTTCTTTGGTTTTGAGC 

AGCCTTCTCCTGCCATGTGTCCCCATCATGCTGTGTGGCCCGAAACGTCAGAG 

CTCACAGACTGATTATTTCAAGTCTTTATATTGTAGTGACAGGAAGCCGGCTG 

AGGCCCACGTCTTACTCTAAAAGGGCAAAGGAGTAGCGTGAGAGGATTTAAA. 

ATGCAAAGCTTCTCTTTCTTTGGTGCTTTCTGTCCTGCACTCACTAATTTAATG 

CCATTGTCAGGAAAGGAGCCATTTAATGGCAGTTGGTTCTCTATATCCAAGGG 

CTCTCCATCCATAGTTTGAAAATGTTTGTGTGTGCATGCATGCATGTATAAGT 

ATAGTACATGTGCATGTGTGAATGTGTGGAGGGCAGAGGCTGACGTTGGGCG 

TCTTCAATTATTCTGCACCTTAGCTTTTGAGACAGGGTCTCACTGGGCCTGAA 

GCTCATTTATAGGGTATATTGGCTGCCTGGAAAGCCTGGGTTTTTATATGGGT 

GCAGAGfiTCAAACTCATGTGTTCATGTTTTTGTAGTACTTAAATGTTTGAGCT 

ATCTTTCCAGCCCATTGAAAATATTTTGAGATGTTTGGGTATGGTGGTGCAAA 

CCTTTAATTTCAGCACTCAGAAGGCAGTGGCAGGCGAATCTCTGATTTTCAGA 

TCAGCCTGGTTTATAGCTCTAGTTCCAGGACAACCAGAGCTACAAAGAAAAC 

AAAATGACCTCTAAAAAACAAAACAAAACAAAATGTTTTGAGAAACAATTTT 
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ACTCATCGTAAACATGTGGGGACCTTTCCCTTGTCACATACAGTGTTCTGTTG 
TCGTGGTGCAGGGTCAGTCCCTGCGGGAGGATTAAAGCACAGGAGGGTACAT 
GTAGGCTCTGTAACATACTACAGACTTCGACAGAAGAGTTTTGAGAAGTCCTT 
TTTGGTATCTCAAAAAGGTGGTAGTTTCCCAAATCGGTCACeCTCAAGTATAA 
5 AAGACCAACCAGAATAAGTGGCATGCATGTTGTTGCCATCTGTG i i 1 1 1 1 GTT 
GCATAATGCCAGTTAGACGTGCAAATGCCAGAACATTTAGCTAGCATACAGA 
AACTAAGACAGGACATAGTTCCCAGTACCCCAGAGGTACCTCATGGCCAAAG 
TCCAGTAGTTTTCATAGTGTGCCAGTACTGTCCATCCTTGCTTACTGGTGGGA 
AAGAGAAGGACAAGACAGGCTTGGTGGCACATGCTTTTAAACCCAGCACTGA 
10 GGGTGGAGTGGGGGGGGCAGGGCTCTATGAATTTTAAGGTCATCCTGATCTA 
CATAGTAAGTTCCAGGTCAACCAAGGTGTCACAGTAAGACTGTGTCTCAAAC 
AACAGCCACGACAAAGAGAGACTTGGGATGACCTTGCTCTCCTGTCAATATC 
TTCAGCATAAGAAGTTGTCCGTTTGATGTAGGGAAGTCTCTTTCCGAGAGATA 
GATGTAGTCCCTTGTCCOTAACTCTTCATGCTTACTTCATCTCTTTCCTGCCTT 
15 TGAAGCCAAGTTCTGGTTTGAATGGGAAATGTGGTCCTTGGACACTGAGCTTG 
CTCAGCTCTGGGTTTAACTAGGAGACTTTATGGTGTGGGTGGCAAGACTGGAC 
ACAAACAGAGCAACAATGTCTCCTTATTGAACAAGTATACTCCAGAGTCTCCT 
GACCTTCACCCTCCCACTAGGGGTCAAAGGTAAGAGTAGCAGGAATTTCAAG 
ATAGGGATACTGTGGGGCCCATGGGGCTGCAGAGGTCTGATGCCTAAGAAGC 
20 TGGTCCTCAAATCACCCACAGAACTGCCGTGTGAAGAGACCCCACATATCAC 
TTGCTTCCTGCCCCTCTTGAGACCACAGCCCCTTGTAGACAGTGACACCCAGG 
CATGTCCAGCAGCAGAAGGGATGTGACATGGATCACTTTTGTCAATGGGGCC 
CTGTTACTCTTACACACCCTCCCTTCTCATAAGATTGGTGACCTTCAGAGACG 
TGGGTACCAAAGCAGCACGTCTGTGTCCAGAAGTAACTGGCCTGATGACAGG 
25 CTGGGCAAACCTAGTGTGGAGTGAGGCTGAAGGGCCCTGATCTAGGGCTGTC 
AAGAGCGAAGGTAGGGGCTGTAGGACAGGGAGGGCCTTGGAGGCTTCACAG 
CACAGGTAGGCCCTTGGGTAGGTGCATGTTGCCGCCTCTGCTCTCCTGAAAGA 
GGGGTCTGGCCCGGTGAGTGGCTTCTCAGATTCAGTGTGAGTGGTTGGTTTGG 
CTTGGCrCTGAGTTCTGAGGGCiTTGGGGAATGGCTITCTGTTCTGGAGCCTAG 
30 AGGAGATGAGAGAGAGGAAGAAAGGATCCAGAAGCTGATTGTATGGGGTCT 
AGGCTGTAAGGGCAGGTGACTAGGGGAGGGTGGAGGATGCTGCCAGGGAGA 
AGACACAGCACAAAGACACAGAAAGTTCCAACCTCCCCGGGGGGCACATCCT 
CCTGGACTCTTTGAAGGGTTTTACCCTATCTAACAGCCATAGGTACCCAGAGA 
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AAGACTCCATGTGAACAGGCCACACACCCCTTATGCCTGCCTGAGGCAGGGA 

CTGGATGTTTGCTGTTTTCTGCTTACGTAGAGGGACTGGTCTGCCAGTTAGAG 

AGGAGCCAGGACCCAGCTCAGACCTCTAGGACACTCTAACCCCTTCCAAGCA 

TGTGAGGAAAAGGCCCATGCTCTGAGCCTGGAGACCAGAGCCAGGTGCTCGG 

GGCTCAGGAAGGACTCCCTAAAAGGCCGACTGGAAGAGGTAACTCTGCCTTG 

AGCACCAGGCCCTGCTTCACCTCAGATCTTCTCCTTGGTGAGCACAAGGAAAG 

GATGAAGAGGAAAGAGATGGACTGAAAGTAGGTATGGTCCACTCAGGGCTG 

AT 1 'CC '1' G TT CCGGG TCGCCA"GAA£CTA^F6€GTTGGTGGCGGTATTGGTCCGGC 

CTTATACCTCCCTGGCTCCAAGTTGGAGGTGATTGGCCAGAGGCCTGAAGGC 

AGAGAATCCTGCCTCCTACCACCAAATGCTTAGCAGGACCCCTGCTGAGATTA 

TGAGCTTCTGCTCCCTTCCCCCACCCnTTCCAGTTTACATATTGGAGTAGTCCA 

CAGAGTTCTGAGCTGCCAGATGCGTAGGTGGCTCCCAGATGTGCAGTCCCAG 

CTACCGGAGGGCAGGGACCACCTGACACTCCCCCAGAGTGACTCTGCCCAAG 

GCCACTCAGCTATAGGAAGCAGGGCAGGGTATGCTTGCCAGCCCCCGTTTCTC 

TACCCCCTGTCCCACCCCTCCCGGGCAGCTACACTCTGTTGTAGCCTATCAGA 

GAGGGTTGTGGACATCCAGCTGGGTGCTGCTTGCCCATACCACCAGGACCAG 

GAAAAAGCTGGCCCATTTACTAATGCCGGAGCGGGGGCACATTTGCGGAGCA 

GCAGAAGGACCTAGGAGCATGACAGCTGCAGAGGCCGAGGTGAGTCAGCTG. 

GACCCGGGGTCCAGGGACAGTACCTTAGCATCTCCTAAGGGATATTTCTCTGC 

GTCTAGAAGTCCAAGGTAGAGGTGTTACAAAGTGTTACATGGCAGATGGTGT 

CTTCTTTTGTGTCCTCATATGATCATrGCCTGTGTGCCTGTGTTCrGGTCCTTTT 

TACAAGGAACCCAAGTCATATGGAGCAAGGCCACCCTAGTGACCTCCTTTTA 

AAAGACCAGTACTCTGGATGATCCGGTGGTTGTGGCTTCAGCTGACAAGTGT 

GTGTGTATAGGCAACAGTGTAGCCCCCATCAGACAGGAAGGGCGGGTCCTTC 

TACATAGCTGTATCTGAGGCCACACCTCTCCTTCACTGCTATCCTACCCTGGA 

GAAGCAGGTACTTCTTTGGTCAAGTCTGGTGGATGGTAACTAGAACTGAGGA 

CAGAGATATCTGCGTTTCTGTGCAGCCATGGGAGAGATATCTCCCCTGAGACT 

AGGTTGTCTATGACCAGACCAGAGAGTAAAAGCTACTTTCTAATATCCCAGCC 

TGGCAAGATGGAGCTTAGGAGAGTAGGTTGGACTCCC7CCTCCCCCAGGGGG 

TTGCATAGCTAAGAAGGATCCTTTCTAGCTGTGACATCATGTGACAGTGGCTC 

CAGACAGGTGACAGTACCCCTGGCTCCAGCTATGGGGTGTTGGGAACCATTC 

AAACAGGGATGTTGACCTCCCCCACCTGTCGGTAGATTGCAGTTTTCCTTGTA 

GCTTGTCCCTTGGGCCCGTTCCTCTAAGTGTTGTGGCTCACTGTGCTGATTGTG 
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ctgtgatgctatctgggaggtgggggcagccctaggctcttAatgaccctgtc 
ttccttgcacctggtgcttgcctgggagccccaggcttagtccagtcccccag 
aggttggctgcattgataaacatgcaaatttccaaaatgaatgcaattaata 
cccaaatgaaagccggggagtggtagaggctggactggatcgtatcttggag 
5 catcaaatatttgtctcttatttgaatgcattttgtaggctggagggtggggc 
ctttaactcagacaacagctgttttataagctaggtttggggagatctgtgct 

GGATTTTATATATCCAACTACACCTATGGCCTAGCTCCTCCCTAACTACATCC 

A^GTGCCCATAGGCTGGCTATTCTATAAAO^A1CX:CTCCTGCATGGTGGG 

CACAGAAAGTAGAGACCTACTTGTCAGACAGTAGGACTGAGGCCTGAGGGTA 

10 GACAGCTTTGCGTAAAAATGTAGTTGGAGGGTCAGGTGAGCATAGTGTCTGG 
GGTTCTTATCCCAGTCTGAAGCCAAGACTTCAAGGACGTGATTATGGCAGCCA 
GGCAGGCAGACAGTTTTCTCAGGACTGACATGGACGCCTGGGTGCACAACAC 
ATGTGTCTGATGGCTCTCCAGTCCTTTTTCCTATTTTGAGCCCACTCTGTGTAT 
AGCTAGTCTGACTTTCCCATCGGGGGTGACAGCCTTGACCACCCCTGGGACTT 

15 GGTTGGGATCCTGGTTACTTCTCTAGTCAATATTTCTCTAAACATGCTTAGAA 
CTAACAGGGTGCTTAGAGGCCAGTTTTCAGAGCAGAGAGCTTCTTCCTCAGTT 
TACCCTGCTGGGGTACCAGGCTTGAGGTGTTTCCrGCITACTCTTTGCCTGTGC 
CCATTTTACAGGTGTGGAGGCTGAGCCTTGGTTGTCTTCCCCCGGACCACTCT 
GGTTGTCACCTTCITCCTCTTGAATACCAATGTGATTTCTATAGCTGGCCTGTA 

20 GCTGGTAGAACTGAGTTCCTAGGATTGGGCAAGAGCAGTGGTTCTCAACCTTC 
CTAATGCTGC AAC ACTTTAATAC AGTTCCTCATGTTGTGGTG ATCTCC AACCA 
CAAAATTATTTTTATTGCTACTTCATAACTATAATTTTGCTACTGCTATGAATC 
ATAATGTAAATATCTATTTTCTGATGGTCTTAGGCAAGCCCTTTGAAAGGGTC 
ATATGACTTCCAAAGGGGTCTCAAACCACAGGTTGAGAACCACTGCTGTAGA 

25 GGCTAGGAAAGGCTGTAGGTTCCTTACAGTCTAGGGTTAAGGAGTGGATGAG 
TGCTGGGGTTGTCTTGGCATTTTCCTGACAGTCATCCCTCCTTCACTCTGTATC 
TCCAGCACATTCCCAGTCATCCCTCCTTCACTCTGTATCTCCAGCACATTCCCA 
GTCATCTCTCCTTCATTCTGGATCTCCAGCGTGTCCCCAGTTTCACGCTGGCAG 
- AGCCATCTTATAATGATGCAACTTTACATTCACTTTGCTGAGGGCTCCTCCTGC 

30 CAGCCTTCCAAGAGATGCCCTTTCTCTCCATCCCAGCCCCTCAGATGTACTCTT 
AGAGAGCAGTTTAGAAGAGAGGCTAAGTTTGGGTGTGGGCACGCCTTTCCAT 
CCATCCAAAAGGCAGGGTCTGGTAGGGCCCTGCTCACTCCTCACCTACCCATC 
CTTCTATTCCTCTTTCTCTAATGTGCTTACCCTGCCTTCTATCTGGGGATACCA 
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GGTGCTGAGGGAGCAGCAGATTGAGCTTTGGTCACCTGCGCAGCTGGGCAGG 
AGGAACCCTAGAGGCCATGGTCAGCTTGGGCCTGTGGATGAAGGAAGGTGTA 
GTAGAAGGAGAGCAAGTGGAATGGAGAGGTGCAAGTTTCAGACCTGTGCCA 
ATGTCCTGTCCCAGCCATCTGTATCCACCCTGTTGTTGAGGTCAGGGTTACTG 
5 GAACCCTGTTGGCCTCCCTAAAGCTATTACATTCTGACTGTCTCTCTGGTGTA 
AAAGAAGAGGGGACACTAGAGGATTCCCTAGGGGTTATGAAGATTGGAGATT 
GGGCTGGACTGAGGCATGATCCTTGGGCTTAATCTAGGGGGCGCCCAAGACA 

CATGACAATCeGTTTGGGCTCCCAAAeTeTGAAGGAAGGGGGTCTTTGGGGAC 

AGTGTTACAACCCCCACCCCCACCCCATGAGTGTGCCCTCAGTCAATTGTCTC 

1 0 AGGG AC ACAAGCCTGCC ATTACCATTACCCCTCTCCCAAGATACTCTGGATCT 
ATAAGTCTCGGGGAGGTCAGGAAATGGCTGACATGGTTCATTTGAGGTATCC 
TTCTGAGCAGGAGTCTGATGTGCACACTGATGAAATCTTCATTCATCTATGTT 
TGGACAATCCCTCAGTCACCACACCAAATGTTTCCAGAGCCACAGGCTTCCCT 
GAGACTTGGCCCCAATCCTGCCATGTTCTGATATCTGTTACCACACCTAGGGG 

1 5 CTGTTCTTGAATTTCACATGAACAAAGCACACACATGTTCCTTCAAGCCATTA 
ACTGCTGTGGTCTCTAAAGATGACCACTGGTGACCTCATAAAGTTCTACCCTC 
TTCTTTGCTTCTGACAATGGTCCTTGGTTCAACTGTTATAGACGATGCTATAAA 
CAGGCCTTTCTTGGGCTAGTGCAAATCGACCCAGTCTTCCTGACGTTTTCTAT 
GTTGGTACCCAAAAGGGATAAAGCTAGAGGGATATCTTGAGTTTCAATCAAA 

20 GATGGCCACTGAGAGCCCATGAGCAGAGCTGTAGGTAGTTACTGGGCTTGAC. 
TTCCATCTGAGATATTAGGGCATTGGGGTGATGGCCAGAAAGTATGAGTTGA 
GAGACACTGGTCAATGGATGGGCTCCAT^nTTGGTTCTTATCACCCATCCTTGG 
CTGCTGGGAAGTCCTGTGGTGGGAATTGGCCTATGCTGTTGGCTTCTCATTGC 
AGCCTGGGAACAGGCATCTGGTTGAGATGCCTCATTCCTAATGGTCAAAGAG 

25 ATCTTTCCTGGTTGATTAACCTCTGAGGCAGATTCACTTGCATAGTGGTACTA 
GTGCTTAGGAGGCTGTGGCACAAAGATGGGGAGTTCAAGGCTAGTTTGGGAA 
ACCAACTAGCCTTCTTTCAACCAACCAACCAACCAACCAACCAACCAACCAA 
CCGACCGACCAACCAACATCAGACAGACAAATCAATCACCTTAACAGATCTT 
CTACTTTTAAAAGAAGAGGCCAGGTCATTTATTCTCTAGAGCGTTGATGGGTC 

30 GAGTAGGTTGGGACACACAGACTAGTAGGGCACCTCCTCCGTACCATATTGG 
CTCTGGACTGAGATATTGGGGATCACTGGCCTTTCTAAAATTTCTATGAACCC 
TTGTGAGTTGTTAATGACCTAGTATCTAGGTTCCAGAGCTCAGCCAAGAACAA 
CCTGGAGACAGGGCACATCTGGAGTGGACATAGGGTACCTGTAGGTGGGAAA 
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" TAGAACTTGAGTGGCTGGTGATGGTAGGATAGGCTAGTTGACCTGCCGCGGG 
CACAGTCAATTGCCCCAGAGGAAACTTGGAATTGTGCAGAGCTGACAGGCAG 
AGTCAGGGAGGTCCGQGGGTTCAGCTGGTTTGGGGACTTCAGGGTCTGGTGA 
ACTTAGGATCAGCAAGACAGACAGAGAGATGACAGGCAGTAGGCAACAGAC 
5 CTGGACACAGCCCAGCACCACCTCTAAGGCCTGGCCTCCCCTTCCCCTGTGGT 
GGCAGCCACCCCACCCCCAAGCAGCACCAGGTTGGTTCAGGTTGAACCTGTT 
CTGGGCAGGGCCACCACTGGGGGTGGATTCCAGCTTCTGTTTCTGCTCTGCTT 
CATCTGAAGCTGGAAACGCCTGGCCCAGCCCTGGCCACTCAAGCCCAGCCCT 
CAGCCCCAGGGCTCTGCTGACAAGACACCAGGCAGCCAAGTGCTGCCCCTCC 
10 CTTCTTCATGAGCAGGGGAATGGCACAGGCACAAAGACCCCGTGGTGGTGCA 
CCATGCACACATACACAGGCAGAGACATTACCAACACACACATATTCACATA 
CCCAGATACACGCAGAGATATCCTGAAGCAAGCATATGTACATGTACACACC 
CAGATACACCCAGACACTTTAAAAACACACACCCAGAGACATGCCAGACATC 
ACAAAACACATATGCTCAGATATATACAGAGACACTCCAAAACACATATAAA 
15 TACACACACCCAGATACATGCAGACACCCCAAACACACACCTACATGCACAC 
ACCCAGATACATACTAGGCACTCCAAAATACTCCAGATACATGGGAGACTAG 
TTTGGGCAACCAGCTAGCCTTCTTTTAACCAACCAACAACCAACCAACCAACC 
AACCAACTAACATCAGACAGATAAACCATCCATTGTAACAGATCTTCCACTTT 
TAAAAGAGGTAGGTCAACTGGACTCCCCTGTCAGCAGGTGAAACACACACAC 
20 ACAC ACACACACACACACACACAC ACACACACCACACACACACCACAC ACAC 
ACACACACACACACACACACACACACTACACACACACACACACACACACACA 
CACACACACAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGA 
GAGAGAGAGAGAACATGCACACATCCAGTGTGGTGGTTTGAATAAGAATGGT 
CCCCATAGGCTCATCTGTTTGAATGCTTGGTCACCTGTTGATGGAACTGTTTG 
25 GGAAGGATGAAGAGGAGGTGTGCCACTGGAGGTGGGCTTTGAAGTCCCAAA 
ACCCCACACCATTCCTACTTAGCTCTTTCTGCCTTGTTCTTTTAAAAAAATTAA 
TTTATTTTTATTTTATGTACATTGGTATTTTGCCTGCAGGCGTGAGGGAGTCAT 
ATCCCCTGGAACAGGAGTGTAGACAGTTGTGAGCTGCCATGTGGTTGCTGGG 
AATTGAACTCAGGTTCTCTAGAAGATCAQCCAGTGCTTTTAACAGCTGGGCCA 
30 TTTCTCCAGCCTGTCTGCCTTGTTCTTATAGAtTAATATGTGAGCTCTCATTTC 
AATGCTATTGCCTGTCTATTACCATACTTCTCACATGATGGTCATGGATTCTAA 
• CTTTATGATACCCTAAGCCATAAATTAAACACTACCTTTTATAAGTTGACTTG 
GTCATGGTGTTTTATCACAGCAGTAGAAAAGTTACTTAGACACCCAGATAAA 
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ATCAGATACCCCAACACATACACACGCAGACACACACACACACACACACACA 

CACACACACACACACACACACAGAATTATGTAAAGACATCCTGACACACAGA 

CACACTCTAATACATACACACAGACACACACTCATACACACACACACAGACA 

CACACATACACACACTCTCTCTCTCTGAGAGACAAATGGATACATAACACGT 

ACACAGACACATTGAGACAGATACATACACAATTACAAGACGAGTTGAAACA 

CACACCCATAGATACACAGCATACACATGCACAGACTAAACATGTGTGTGTA 

GGTAGACACACAAGCACACACACCTGCACAGACATGAACTGTCGTGAAGGTT 

GGACTGTAGCCTTGGACAGTATAGATACAGGCATGGGTGGCTCCATGCTGAT 

TACAAAGTCCCCCATGTTTGTTCCCTACTTCTCATGGTTTTCTTTACCCTGTTC 

ATTTACAAGTGCCTACTCAGACATCCCTTACTCCTGACACCCCACGAAGCAGA 

GCCAACTCCATCACACAAATCGACCTACACCTTTTATGGCATCCTGCATGGTG 

GCACCGACACCCCCAAACCCACCCCTTACCATCTCCTCTCCATCTATCCATAG 

TGTTGGATGAAAAGGCAGGTTTTTAGATCAATGGAGCAGAGCAGATCTAAGG 

GGTAGACCCTGTAAACATGGCCAGCTGAGCTTGGGCAAGATGTGAAAGAAAC 

TGGATGGGAAAGAATGGCATATGTCTAGAGCTTTTGGTGCTAGCTGCCATCAT 

TTCTTCTAATTTCAGCCTCAACATAAGTCTACAGGCAGCTGAGGAGCAAGGCA 

CTGAGCTCACGTCTTTAACATCTGATGAAATAGAATGCCATCGCAGCCATTAC 

TTTGCCGTACATATCTCTTCTTTGTTGGGATGTTTGTTTAAGCCCITTGC 

TGTTGTTGTTATTATTGTGATTATGATTATTATTGTGGGTGTGCATGATGTGTG 

TGGTGTGCATGCTTGTGTGTGGTGGTCAGAGTATAACTCTGTAGAGTCGATTT 

CCTCTTACCACCTTTACATGGGTTCCAGGGATCAAACTCAGGTGACTAAGCCT 

GTGCAAAGCTACCirrACTGGCTGAGCCATCTCAATGGTTGCrTTGCAGGTGT 

gtgtgtgtgtgtgtgtgtgtgtgtgtgcgtgcacgtgcaAgcacatgagcact 

GCCAGAGGATCATCTCAGGCATCITTTCACAGATATTATCCAACrTrAA^ 

TATTTTCTGGGCTGGGGAGATGTACATTCAGGTACAGAATTTAGCATCATAGT 

ACATAGCAACACATTTCTGCAGCCCCATGATGGTGGCTTTCTGTTGCTGTTGT 

TAGAGTTCTTCTGTATGCCAGCTGAAGCCCTTGGCCCAGCTGTGTGCTTTACG 

AACACTTTTTGCCAGACTTCATTCTTCTTAGCATTGCCGTCTACATTTTAAACG 

TAACGAAATCCAGCATGTAAGTTTTCTGTTTTATGGATTGTGTACAGGTATTC 

TGGTGGGAAGCCTTTGCTATCTTAGTTGTTTGCAGCTCTGGGCTGCTGCTTCTC 

CAGTAGATGTTTGACAGTGTTTGGAGAACTTTGTGGTTCTCACGTTTATATAT 

CCTCGTGACACAGAGCAAAACAACCCCATAGCAGAGAGATATGAATGAGTCT 

CTAAAAGGCAAGGGAGAGGATGGGAACTTATAACTACCCTAAAGCTAGTTAT 
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AAGTTATAGTTTTTTTCTGATGCAAACCTTTGAGTCACCCATGCACTCATAGG 
TAATCCTTCATCCATCTTCACAGCACCCCTGGCTGCCATTCCCGTAAGTCATCC 
AAGCTGGTCAGTTTTATmACTTTCTACTTGACACAAGTTAGGGTCATCTGG 
GGAGAGAGAATCACCACTGAGAAAACTTCCCAGCAGATTGGCCTGTAGGCAA 
5 GTTTGTGGTGCAGTTTCTTGGTTAATGGTTATGTGGGAGAATCCCATCCATGT 
GGATGGTGGAACCGCTGTGCAGGCAGTTCTGGGTTCTATAAGAAAGTAGGCT 
GGACCAACCATGAGGAGCAAGCCAGTGAACAGCATCCTTTATGATTCTTGCC 

^gA^GTTGGTGCCTIGAGTICCIGCCCTGAT TTCTCTCCAAG 

ACTGTAACATGATACAAACCCTTTCCTTCCCAAGCTGCTTTTGGTCATGGTGTT 
10 TATCAAGTGACCGAAAGCAAACTAAGAAAACAAATTCACTGGCTTGCTAGAC : 
TAGACTTGGTGGTCTGTTGTTGGTGCCCTATCTGGGGCAAATGGACATCTGTT 
TGAGCCTCGCAGGAAAGTCTCACAGCAGCATTCTTTCCCATCATCTCTCTTTG 
GCATCTTGCCCAGACTCACCTGATCACATTTACAGGGATCCACTTCTGGGATC 
CAGTGGAATAATCGTAATGTGGTGACTATGAAATCCTTAGTGAGTGCTGGGCT 
15 GGGTCCTTCGCTTTCTGAGTTTTCTTTGCTATCAACAGCTCATGGTTCATACAC 
ACAGAAAATCATGTTGGGATTTTTAGAATCAATTTGGGAATATTTTGATCATC 
TTGAGAACGACTGATATCTTAAAACTGAATCACACCCATGGTTGTTGTCTGTT 
TTATTAAGACCGTTATTTAATTTCTATCATCAATGTTATATTTTTTGTAGTATTT 
TTTCCTTTTTCCACTTGCATGTGTGTGAGAGTGTTTGCATGTATGCATACATGT 
20 ATTTGTGCATGTTTGCATATATGTGCATTTGTGTGTGCAAATGCATGTGCGGG 
TGTACATGGGTATGTTTGTGTGTGCAGGTGTGTTTGTATATGTGTATATCTGTG 
TGAGCATATACATGTATGTGTGCATTTGGAGGTCAGAGGCTTCTGTGGAGACT 
CATCCATCATTGCTTTTTCACTTTTTTCATCAAGACTTTCAATCAAGCCCAGAG 
CTCACTGATACAGCTAGTCTTACTCTGGAGATGTGCTGCCTCTGCCTTCCAAA 
25 TATGGAGATATTGGTGGGCTACCATGTAGGTCCTGAGGACCCTCACACTTGAG 
TGGCAAGTGCTTTAACCACCAAGCTATTTCCTCAGCCCCAGTGTTTTGTAATTT 
TAATCTTTATAGGTGATACATGTTTTGTTATAGCTATTTTGCTGCTCTAAGTTG 
AGCTTTTGGAAAAATGAGTTCTAATAGCTCATTACAGATATTTGAGAACATGC 
TTCA^rTTCTCGACATTAATACACACTTAGTTTCCTGGTGAGCTTTTGTTTGTTT 
30 CTGGTGGTGGGGTACTACTTAACATCCATAAfATCCATGCCACTGGGAAtAGA 
GATAATTTTATTTCTTCCCTTAAAATATATATGTTTTAAAAATGTTATACTGGT 
TGAGCCTTCCAGTGAAAAATGTAAAGTCAGAGAGGCAAGAGAGAAACCCCAT 
GCCTTCTCTAATCATAAGGAGAGCACCACTGTGGTCCTTCACCATTGAACATG 
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CTAGCTTCCTTTTGAGATCCTAGTTATTATTTTCAGTGCTAGGGGCTTAATCTA 

AGACTCTTGCACATACTAAGCAAGTATACTAGCCGTCACTGACTGTAACCTCA 

GTTCTTTTTCCACTTGAAAAAATAATGTAGTGTGTGTGTGTGTGTGTGTGTGTG 

TGTGTGTGTGTGTGTGTGTGTGTGAAGTTCTCATAGTGACCAGAAGATGGTGT 

CAAATCCCTTGAACTGGAGTGACAAGACAGTTCTGAGCTTCCATGTGGGTGCT 

GGGAATTGAACCCAGGTCCTCTGCAAGAGCAGTAAGTGTTCTTGACCACTGA 

GATATCTTCTTGACCCCATGGATTTTTTTTTTATTGT^^ 

TTGGCTTATTAAAGTCAATCTTTCTATTTTGTTTTGTTTTGAGATTTTGAACCT 

AGGATCTTGTGCATAGTGGGCAAGTACTCTAGTACAGAACTACATCCCTAACT 

CTGGTTTATTTATACCCCCTTGATAAAAAATTTGCCTTGGATTTTATTTTATAT 

CATAACATATTATGCAATATCACACATACACATGTGCAAACACAAACTTAGTT 

ATTTAATCAATGGCTACCAGATAATAATATCACACTGTTTTATGTATCATGGT 

AGGAATATTTTTAAAGATTGGTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCT 

TCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCTCTTCCTCTTCCTCTTCTCTC 

TCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTGCGTGTGTGTGTGTATGTGT 

GTGTGTGTGTGTGCCACCTGTGTGTAGGTGATTGCAGAATCCAATAGATGGTA 

TTAAATCCCCTGGAACTGGAGTGGCAGATGTTTGTCTGCACTCTTCTTCTAGG 

CTTCTATTCTGAGATCTCAATTAAATATGCGTTAGAGTTAAATATCAAAGCTC 

TGTCAGTGCAGGAGGATGGGGGTGGGGCGCCGGTCTGTTTTTATGGCTATCCC 

TGTATTTCCATGTCCTGTGGACCCAGCCTCCTCAGAGTTCCTCCTCCCCCTCCT 

GCTGTGGTTCCGGGGTGCACAGCTCACTCCTCACCACAGGTTTGTGGGCTTCT 

AGAATTCCCCGGATGGGCTGTTCTTCGATGGCACCCTAAGCTGGATGGTTTTC 

TTTCTCCCCCTCAGGGAGAGCCTTCTCTCTGCAGTAGCAAGCAGCTTTCCCCC 

CAGGAGCAAATAAACTAACTGGCTGAGTTTGCGCCCTGGGCAGTGTGTTAGT 

CACCTTTCTGTGATTTTGCTTCGACAAAATACTTGAGAAAAACAACTGAGAAA 

GGATTTAGTTTGGCTGGTGGTTTTAGTCTCCTGGTGGGGTGGGGACTGGCTGC 

ATTGTTTCTTGGCCAGTTGTGGGGGTGGGGGGTGGGGGTGGGGGGAAGACAG 

CCTCGTGACAGAAGGGTGTGGCAGAGCAGACCTGCTCTAGTCAAGGTGGTGG 

GGAAGCAGGAAGCAGAGT^AGGACAGTGGTGAGGCTAAGTATACCCTTTAAA 

GTCACACTTCCCAGTGACCTACTTCCCCATCCAGGCCCCTCCTCCTAAGAGCC 

CAGCTAATTGTGAGTTCATCAGTAGATTCACCCTTGGATTTAGTCAACTAATG 

AGCCAATCACCTCTCAGTAGTGTCATTAGCTTGGGGCCAACTCTGTAACCCCT 

TTTTTGTTGGGGGGGGGGATATTTAATTGAAGAGCTATGGACACTTGATGTCT 
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CCTGGGAGAAGGAGAGTCAGCTATCTATCTATCTAAATATCTATCTGGAGTCA 
AAGGGTTATTTAAAAAAAAAAAAAAACAAAAAACCCAGCCAAACATCATGG 
TGACACATGCCTTTAATCCCAATACTTGGAAGGTAGAGGCAGGCAGATCTCT 
GTGAGTTGGAGACCAGTCTGGTCTACATAGAAAGTTCCTGAACAGTCATGGTT 
5 ACATAGAAAGACCTTGTCTTAAAAAGAAAAATTAGACAGGAAGTTGATGGGT 
GTGGGTTGGGAAGATGGGTGGATCTAAGGAATTAGGGGGAAGACTTGGGGG 
GGATAATTATGATCAAAATGCATTGTATGAACTCTCAAAGAATTAATACGAA 
TATTTTTAAAGTAGCTATAATTCAAGGTTGTCTTGGGTCTGGATACCAAGGGG 
TCCTGGACATTGAGTTGCAGGGCAAGACCACAAGAGACAGGCAGCTGATCAA 
10 CTATGTCCAGAGAGCTGCAGTGTGGCCTGACTTCTGTCAGAGAGAGAGAGAG 
AGAGAGAGACAGACAGACAGACAGACAGACAGACAGACAGAGAGTCTTTAA 
AGGATCTGTAACCTGACACACAGTATGGGGGATCCCAAGATACCCACTAATC 
CCCTTTGTAAGTATCCAAAGATATGCATGGACACTCTCAGTGGGTGGTGGCAT 
GTATCAATGCACTCCAAATTGAAGCAAACCTAAGCTGCTTAGGTTTCTGTCAA 
15 CTTGACACAAGCTAGAGTCATCAGGGAAGAAGGAACCTCAACTGAGAAAATG 
ACACGATTTTTTTTTTCTGGTAGGCAAGTCGGTGGTGCATTTTCTTGATTAAAG 
ATTTAGCTCACTGTGGGTGGTACCACACCTGGACAAGCGGTCCTGGGGTGTGT 
AAGAAAGCTGGCTGATCAAGACAGAACAAGCCAGTAAGCAGTGTTCTTCCAT 
GGCCTCTGTTTCAGTGGCTGGCTGGCTTGAGTTACTGCTGTGACTTCCCTTCCC 
20 GATAAACTGTAAGATGAAATAAACCCTTTCCTCTCCAAGTTGCTTTGAGCCAG 
TGTTTTATCACAGCAAGGGAACCTAGAAAGGGAGTGGTGTAAGATCCAGGAA 
GATGGGGAAGTTAGAGGGTTCAGAAGCAGCAAATAGTAACTGACTAGAGGC 
GCTGCCTACAAGCTGGGCAGGGAGCTCCAGTGACACAGATTTTTGTGAGTTAT 
GGCCCATGTTGGATTAGGCTTTTCAGTGATGTACCTGCCTCTGAGTCCCCCGT 
25 GCTTCTGTGTGTAACTGTAATAAGCTTAGAGGCTGAACTAGACTCACGAGGA 
ATCCTTTCTCTGGTCTGTAATGGGTGGCCTTTCCGAGTGAGAAGATATTTGTTC 
ACACCTTCTCCCCCAGGAGAACTCACACATGTTCACAACCCAGGGATTCTATG 
TCATCTTGATACCCATTTGGCTTTCACAGATGCAAAGCTGGTGTGGCTGAATT 
TTTCTTCGGATGCTCGGCCTCACGGGCCTTGGGTipTCTTGTGCTGGTATTCTC 
30 ATGGCTGCTGCTGCTCCCTTCCCTGCAATTTCAGCTTTGGTGGATTGGCCTGTG 
AGCTCAACTTTGACTATTTCAGGGAGAGTTGTAAGCCTGGGGCGGGGTCGGG 
GGAGAGAAGCATTTGGCTTCTGCTTAGGTTGGTGATAGAATGGGAGTGGCTTT 
ATAAGTCCCTGTGGAAGCTAAGCAG 
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SEQIDNO:4 
Mouse ltrpc5 cDNA sequence 
AAGCATCACGAGCAACAGCCCTGAGCCTCAATTTGGAGGCAGT 
CCTCATGCAGTCCAAGGCACTGACTGCCTGACTGGAAAGCCACCATGCAAAC 

5 AACCCAGAGCTCCTGCCCCGGCAGCCCCCCAGATACTGAGGATGGCTGGGAG 
CCCATCCTATGCAGGGGAGAGATCAACTTCGGAGGGTCTGGGAAGAAGCGAG 
GCAAGTTTGTGAAGGTGCCAAGCAGTGTGGCCCCCTCCGTGCTTTTTGAACTC 
CTGCTCACCGAGTGGCACCTGCCAGCCCCCAACCTGGTGGTGTCCCTGGTGGG 
TGAGGAACGACCTTTGGCTATGAAGTCGTGGCTTCGGGATGTCCTGCGCAAG 

10 GGGCTGGTGAAAGCAGCTCAGAGCACAGGTGCCTGGATCCTGACCAGTGCCC 
TCCACGTGGGCCTGGCCCGCCATGTTGGACAAGCTGTACGTGATCACTCTCTG 
GCTAGCACATCCACCAAGATCCGTGTAGTGGCCATCGGAATGGCCTCTCTGG 
ATCGAATCCTTCACCGTCAACTTCTAGATGGTGTCCACCAAAAGGAGGATACT 
CCCATCCACTACCCAGCAGATGAGGGCAACATTCAGGGACCCCTCTGCCCCCT 

1 5 GGAC AGC AATCTCTCCCACTTCATCCTTGTGGAGTCAGGCGCCCTTGGGAGTG 
GGAACGACGGGCTGACAGAGCTGCAGCTGAGCCTGGAGAAGCACATCTCTCA 
GCAGAGGACAGGTTATGGGGGCACCAGCTGCATCCAGATACCTGTCCTTTGC 
CTGTTGGTCAATGGTGACCCCAACACCCTAGAGAGGATTTCCAGGGCAGTGG 
AGCAGGCTGCCCCATGGCTGATCCTGGCAGGTTCTGGTGGCATTGCTGATGTA 

20 CTCGCTGCCCTGGTGAGCCAGCCTCATCTCCTGGTGCCCCAGGTGGCTGAGAA 
GCAGTTCAGAGAGAAATTCCCCAGCGAGTGTTTCTCTTGGGAAGCCATTGTAC 
ACTGGACAGAGCTGTTACAGAACATTGCTGCACACCCCCACCTCfcTCACAGT 
ATATGACTTCGAGCAGGAGGGTTCGGAGGACCTGGACACTGTCATCCTCAAG 
GCACTTGTGAAAGCCTGCAAGAGCCACAGCCAAGAAGCCCAAGACTACCTAG 

25 ATGAGCTCAAGTTAGCAGTGGCCTGGGATCGCGTGGACATTGCCAAGAGTGA 
AATCTTCAATGGGGACGTGGAATGGAAGTCCTGTGACTTGGAAGAGGTGATG 
ACAGATGCCCTCGTGAGCAACAAGCCTGACTTTGTCCGCCTCTTTGTGGACAG 
CGGTGCTGACATGGCCGAGTTCTTGACCTATGGGCGGCTGCAGCAGCTTTACC 
ATTCTGTGTCCCCCAAGAGCCTCCTCTTTGAACTGCTGCAGCGTAAGCAT14 AG 

30 GAGGGTAGGCTGACACTGGCCGGCCTGGGTGCCCAGGAGGCTCGGGAGCTGC 
CCATTGGTCTGCCTGCCTTCTCACTCCACGAGGTCTCCCGCGTACTCAAAGAC 
TTCCTGCATGACGCCTGCCGTGGCTTCTACCAGGACGGGCGCAGGATGGAGG 
AGAGAGGGCCACCTAAGCGGCCCGCAGGCCAGAAGTGGCTGCCAGACCTCA 
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GTAGGAAGAGTGAAGACCCTTGGAGGGACCTGTTCCTCTGGGCTGTGGTGCA 

gaatcgttatgagatggccacatacttctgggccatgggccgggagggtgtg 
gctgctgctctggctgcctgcaagatcataaaggaaatgtcccacctggaga 
aagaggcagaggtggcccgcaccatgcgtgaggccaagtatgagcagctgg 

5 ccctggatcttttctcagagtgctacggcaacagtgaggaccgtgcctttgcc 
ctgctggtgcgaaggaaccacagctggagcaggaccacgtgcctgcacctgg 
ccactgaagctgatgccaaggccttctttgcccatgacggtgtgcaagcattc 
ctgaccaagatctggtggggagacatggccacaggcacacccatcctacggc 
ttctgggtgccttcacctgcccagccctcatctacacaaacctcatctccttca 

1 o gtgagg atgccccgcag aggatggacctagaagatctgcaggagccagaca 
gcttggatatggaaaagagcttcctatgcagccggggtggccaattggagaa 
gctaacagaggcaccaagggctccaggcgatctaggcccacaagctgccttc 
ctgctcacaaggtggaggaagttctggggcgctcctgtgactgtgttcctggg 
gaatgtggtcatgtacttcgcattcctcttcctgttcacctatgtcctgctggt 

15 ggacttcaggccaccaccccaggggccgtctggatccgaggttaccctctatt 
tctgggtgttcacactggtgctggaggaaatccgacagggcttcttcacagat 
gaggacacgcacctggtgaagaaattcactctgtatgtggaagacaactgga 
acaagtgtgacatggtggccatcttcctgttcattgtgggagtcacctgtaga 
atggtgccctcggtgtttgaggctggcaggaccgttctggccattgacttcat 

.20 ggtgttcacaotcggctcatccacatctttgctattcacaagcagttgggtc 
ctaagatcatcattgtagagcgaatgatgaaggatgtcttctttttcctcttct 
tcctgagcgtatggcttgtggcctatggtgtgaccactcaggccctgctgcat 
ccccatgatggccgtttggagtggattttccgccgtgtgctatacaggcctta 
cctgcagatctttgggcaaatccctctggatgaaattgatgaggctcgtgtga 

25 actgttctcttcaccctctgctgctggaaagctcggcttcctgccctaatctct 
atgccaactggctggtcattctcctgctggttaccttcctgcttgtcactaatg 
tgctgctcatgaaccttctgatcgccatgttcagctacacattccaggtggtg 
caaggcaatgcagacatgttctggaagtttcaacgctaccacctcatcgttga 
ataccatgga^gaccagctctggccccgcccttcatcctgctcagccacctga 

30 gcctggtgctcaagcaggtcttcaggaaggaagcccagcatAagcgacaaca 
tctggagagagacttgcctgaccccttggaccAgaagatcattacctgggaa 
acggttcaaaaggagaacttcctgagtaccatggagaaacggaggagggac 
agcgagggggaggtgctgaggaaaacggcacacagagtggacttgattgcc 
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AAATACATCGGGGGGCTGAGAGAGCAAGAAAAGAGGATCAAGTGTCTGGAA 
TCACAGGCCAACTACTGTATGCTCCTCTTGTCCTCTATGACGGATACACTGGC 
TCCAGGAGGCACCTACTCAAGCTCTCAGAACTGTGGTTGCAGGAGTCAGCCA 
GCCTCTGCTAGAGACAGGGAGTACCTAGAGTCTGGCTTGCCACCCTCTGACAC 
5 CTGAAATGGAGAAACCACTTGCTCTAGAGCCCCAGACCTGGCCACATCGAGT 
TTTTGGGGCACATCAACCTTCCCCCACTCCCAGCAGCCCCAAGAAATGGTCTT 
CAAGGCCTTGCTACAGATCACTTCTTGGACATCCCTTCCTAAGAGAATGAAAC 

TCATGTCAG 
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SEQIDNO:5 
Predicted mouse ltrpc5 amino acid sequence 
(translation from GI9754868) 
MQTTQSSCPGSPPDTEDGWEPILCRGED^GGSGKKRGKFVKVPSSVAPSVLFELL 
• 5 LTEWHLP APNLWSLVGEERPLAMKS\^RDVLRKGLVKAAQSTGAWTLTSALH . 
VGLARHVGQAVRDHSLASTSTKmWAIGMASLDRILHRQLLDGVHQKEDTPIHY 
PADEGNIQGPLCPLDSNLSHFILVESGALGSGNDGLTELQLSLEKHISQQRTGYGG 
TS C I Q IP VLCLL VNGDPOTLERISRAVEQ AAP WLIL AGS GGIAD VLAALVS QPHLL 
WQVAEKQFREKFPSECFSWEAIVHWTELLQNIAAHPHLLTVYDFEQEGSEDLDT 
10 V1LKAL V KACKSHSQE AQD YLDELKL AVA WDRVDIAK^EIFNGD VEWKS CDLEE 
VM TD AL V SNKP DF VRLF VD S G ADMAEFLT YGRLQQLYHS V SPKSLLFELLQRKH 
EEGPXTLAGLGAQQAP^LPIGLPAFSLHEVSRVLKDFLHDACRGFYQDGRRMEE 
RGPPKRPAGQKWLPDLSRKSEDPWRDLFLWAVLQNRYEMATYFWAMGREGVA 
AALAACKIIKEMSHLEKEAEVARTMREAKYEQLALDLFSECYGNSEDRAFALLV 
1 5 RRNHS WSRTTCLHLATEAD AKAFFAHDGVQAFLTBOWWGDMATGTPILRLLGA 
FTCPALIYTNUSFSEDAPQRMDLEDLQEPDSLDMEKSFLCSRGGQLEKLTEAPRA 
PGDLGPQAAFLLTRWRKFWGAPVTVFLGNVVMYFAFLFLFTYVLLVDFRPPPQG 
PSGSEVTLYFWWTLVLEEmQGFFTDEDTHLVKXFTLYVEDNWNKCDMVAIFLF 

rVGVTCRMWSWEAGRTVLATOFMVFTLRLIHIFA^ 
20 FLFFLSVWLVAYGVTTQALLHPHDGRLEWIFRRVLYRPYLQIFGQPLDEIDEARV 
NCSLHPLLLESSASCPNLYANWLVILLLVTFLLVTNVLLMI^LIAMFSYTFQW 
GNADMFWKFQRYHLIVEYHGRPALAPPFILLSHLSLVLKQWRKEAQHKRQHLE 
RDLPDPLDQKnTWEWQKENFLSTMEKRRRDSEGEVLRKTAHRVDLIAKYIGGL 
REQEKRIKCLESQANYCMLLLSSMTDTLAPGGTYSSSQNCGCRSQPASARDREYL 

25 ESGLPPSDT 
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SEQIDNO:6 

Human genomic squence of the region that contains ltrpc5 
gi|3687269|gb|AC0036"93.1|AC003693 Human Chromosome l"lpl5.5 PAC clone 
pDJ91 5fl containing KvLQTl gene, complete sequence [Homo sapiens] 
CTAAAAGTGCACTTCTAAGGACGCGGCTTCGGTGTTTCCCATGCCGCTGCTTG 
CCCCTGGGAAGCGTTGGCTCTGCCTCGGAAGAAGTTAGCGCCAAGATGGCAG 
CCTGGGGTCTTTGGGGCCCAGAAGAAACACTGGCCCCGGGGAGTTCAGTCAT 
CAGGGACTTAGGATGTGGGGGCTTTTCAAACAGCTTTATTTAGACGTGATTGA 
CACACAGTAAATACAGATGTTTAAGGGTACAACTTGGTAAGTTTTGACAAATT 
TATACCCCCGTGAAACCATCACCAACTCCCCAGGTGCCCCTGGGGCCCTTGGG 
ATCTCTGCTTCCTGCCCCTCCTCCCCGTCCCAGGGCAACCACGGGCCGTCGCT 
GTGGGTGCACACAGCATGCATTTCTTCAACAAGCGGACTCAGAAGGCACTTG 
CACATCGTTGCTGTTCTGCCTCTTTGCTTCAGCATGATTACCCAGAGGCGCAC 
CCGTGCCGTGGCCTGCCCGTCGTCTATGCACCCGTGCTGTGGCGTGCCCGTCG 
TCTGTGTGGCATGCCTGTCTGTGCACCCGTGCTGTGGCGTGCCCGTCGTCTGT 
GTGGCATGCCTGTCTGTGCACCCGTGCTGTGGCGTGCCCGTCGTCTGTGCACC 
CGTGCTGTGGTGTGCCCGTCGTCTGTGCACCCGTGCCGTGGCGTGCCCGTCGT 
CTGTGCACCCGTGCTGTGGTGTGCCCTTCGTCTGTTCCTTTTATTGCCGGGCAG 
GGTTGCACCCACATGTGCAAGCCAGCGACGGACCCCAGGTTCACCCGTTCAC 
CGGTCAGTGGGCATATGGGTTGTTTCAGTTTGGGGCATTTACAAGAAACGTGC 
TAGAACATTTGTGTACAAGTCTTGTGTGAACCTAAGTTCATTTCTCTTGGGTA 
AATACCTGTGCGTGGAGCAGCTGGGTCATGTGGTGAATGTGGGTTTCACTGCT 
TAAGCAGCAGTTTTACATAACTGCCAAACTGTTATTCAAGGTGGCTGGACCGT 
TTTACAGCCCCCGTTGTATGCGTCCCAGTTGCCTCCCCCAGCAGCATGTGGTG 
TGGTTGGTCTTTTTCGTGGCAGCCAGTCCACTGGGTGCGCTCGGCATGTGGCT 
GCAGCTTGACCTGGGTTTCCTGGTCCCTGGCAAGGTGGAGCATCTCTTCATGT 
GC1TTTTTGCTGTGTGTGGATCTTGCGGGGAAGGGTCTGTTCCTGTTTTTTGCC 
CATCTTTCAAAGATTGGGTTGCCAGTTTTCTTGCTGTTGAGTTTGGAAAGCTCT 
GCATACGTTCAGGGCACAGGTCCTTTACCAGGCTCTGCCCCAGGTCTTTCGGA 
GAGCAGGTGTCTTTCGCATTCCTGACTCTGGGGAACCTCTAGCCCTGCCACAT 
GGGGTTTGTTATGGGGCAGGGGCACCTGTGCCTTTCCCACCACGGGGCTTGGG 
GATTTGGTGCTGCCATTGCCCTCCCTCGTAGGTGGCCCTAGGGGGGTCCCTCC 
GCCTCCGTTTCCTCATCCAGAAACCGGCAGTGACCATCACCACCATTGTTGTC 



WO 02/054069 



154 



PCTAJS01/49808 



ACCTAGCTCCAGCTCAAGGTCCCTGCTGAAGGTCGGAGAGCTTGGCATGGCC 
CCGTTTGTCCATGCTAGGGCTGGGAAGACCAAGGCTCAGGTGAGGCCTCTGC 
CCAGTGCCTGGCACTCCTTCTTGCCCCATTTTTCCACCCAGGGTGGCTCCCGA 
CTACTTCTGGTAGCCTCGGGGACAGTTGAGGTGGACAGGCTGGCGTCACCCC 

5 CATTTCCGGCTGTCCCTCCCACCCCCTCCTGGCCCAGCTGTTCTGCCCTATTAA 
AAGTCACATGGGCCCTCGGGTCCTTCCTGGTGTTGGCCCAGGCTCTTTCAGGC 
CCTGCAGGCCAGGACCAGCCTTCCCTGCAACCCTCGGCAGAGGCCTGGGGCC 
GGGGCTTGTCTAGGGGCAGCCTCCCCATACGGCCCTGGAGTCTGAACAGAAG 
CCCCTTCCCAGAGCACAGCAAGAAGCTGCAACGTGGCCTGAAGTCCCACCAT 

10 TAGCAGGTTTGGGGTTTAGGCTGAGCTTTGCCATCACTACCTTTCTGTTAGGA 
CGGTATGCCCATTAGATGGGATCATCCCCTCAGCGCCCAGGCTAGAGGAGGG 
GTGGTCCCTGCCCAGCCAGGGAGGGCTGGGGGTGGATGGGGCTCTACAGAGC 
AGCTTCCGAGCCAGGCACGGTTCCATGATCAGCTCTGTTTTATAGAGGGGGAC 
ACTGAGGAACCGGGAGCCTGGGGACCTTCCAGTGGCCCCACAGCTCCTGTGG 

15 CTGAGTCAGGGTTTGTCACCAGGCCTCTGTGGGGATGAGGCTCCCCCATCCAC 
CTGCCCCACTCTGTCCTGGAACAGCTCTCAAAACGGTCTCTGGACCACAGTTT 
CAAAAGAAAATAAGCAATGTTTTCAAAGGCCCTGGAGGAAGCCAGAGTTACC 
ACGGCAACTCTCGGCCTCGCCACCTCCTCCCGCCAGGCTGCATCTGGAGCCAG 
CTCAGGAGGGCAGCAGGGTGAGGACAGCCAGGCTCTCTGGGGCCACCCCCCA 

20 GCCCCCACCCTTCCTGCCTCTCCTGCACTGTCCACGGCCCTCCCTGTGCTCCCA 
CGGGTATAATGGGCACAGAAGAACCAGGAGCTGTCTGCCCCTGCAGGATTCT 
GGAAGCCAGGGGCCCCTGGCCTCCCTGGGGCCTTGTCATGTGAGGGGCACAC 
GTGGGGTCCCAGCTGCCACATGGCTTCCAGCGCTGCCCGCAGGTGTATGTTGG 
GCCCTTGGTGACTCTAATGCACCTTCCACTCGGCACAGAAGAGCTTCAGTCTG 

25 GGGCCTGGGCGGGGGAAGTAGGCTGCCATCCTCGCTAAACCAAAGTGTGAAA 
ATTGAGTTGAAACTCCCATAGGAGGGCAGGAGGCACAGCTCCTCAGAAGAAG 
GTCTGAGAAACCACAGCCCAGGTTGTTGTTTCGGGTGTGTGGAGAAGGTGCT 
CTGGCAGTCCTGCTACAGGGGGACCATCAACAGCCCCTTTGGGGTGAGAGCC 
CCGTGGCTGCTGGCACCAGCAGCCCCTATGAGGCTTATTTTATTTTTGAGACA 

30 GGGTCTTGCTCTGTCACCGAGGCTGGAGTGCAGTGGCACAATCATAACTCACT 
GTAGCCTCAACCTCCTGAGCTCAAGCGATCCTCCTGCCTCAGCCTCCAAAGGT 
GCTGGGATTACAGGCGCTTGCTACCACGCCCAGCCCCCTCTGGCCTTATTGTT 
TGCCAGGCCCAGCTCAGGTCCCGGAGGAGGGGAGACAGGAGTGTGAGGGAA 
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AGGGGGAAGAGGTATAGAGCCCCCAGCTCCTCCACCCACCCGAACCCTCACC 

GAGGCCCTAGACCCTAGACCGGCCTGACCGGGGGGTCCTCAGGCCGGGGACT 

TGGGTGCAGGCCATGGTGCTGGGGCCTGAAGCTCAGGCTCTGCTGAGCACAG 

CCCCCTGCCCAACCCCACCCTGGGGCCCTGCTTCCCTGGCCAGGGCCATTGGA 

ACAGGAGTGGGGCTGTCCAGGTGGTGTTCTTGGGTCCAGCCCTCAGTTTCTCT 

TCTGCAGTTGACCGGCAGCCCTGCATCTGTGGTGGGGTCGGCGCCTGGTGCTG 

GTGAGGCAAGGCCTCAGCTGCTGGGACAGGACCTGCCTGGCACCCAGCTGGT 

GGCAGAGCCAAGCATTCCGACTCAGCTCTGGGAGCAGCTGCCTTCTGGGCTG 

GCATTCTCCGCCAGGGGGGTTGTGCCCTCGTGGCCCCCCCCGGGTGCCTCCTC 

ACCTGGCTGATTTCATCTCCTGTCCCCCTGCCTCCTCCTCCAGGAAGCCCCCA 

GGGCCTGGCCCTCCTTGAGAGTGGCATGGAGGAGGAAGAAGACTCGCCCAGG 

CCCATGGGAGTCGGATGGTGGCCGCACTTGTGGGGCCCTGACCCCATAGGCT 

TCTTCAGCACGCCCTGGCCTGGGTGATCCCTGCCTGAGGGCTGTGCACGGCTC 

ATCTGCCAGACCAGATTTTAGGGGATTCTTGTACTGTCCTCCTGGAGCAGCAG 

GGGGTAAAGCCTGACCCACCCAGAGTGTCCAGCAACAAGGGCCTCCTGCTGT 

GGGCCAGGGACCCTGGAACTGACCAATTGTGTCCTAGGGACGCAGAGTCCCC 

AGGCTGCTAGAGGGCTGTGGGGCCCTGTTTCATGCCTGAAGCAGGAAGAAAC . 

CCCAGGAGAGGTCTGAAGGGGACCCAGCCCCCACCCTGTCTAGCAGGGAGGA 

GCCTCTGCAAGAGGCCGAGGGGTGCTGAAGTGGAGGAGGATAGAGGCAGCA. 

GGACTCAGGGTCACTGGTCATTTATGGGGATCACACGGCTGCAGTGTGCCCTG 

CATGGTGCTAGGCACCAGGGACAGCAGAGGACAAGCCTGTGTCCTCTCCCAC 

CACCAGAGGGCTGGGCACTGCCCCTAGGGAGAGAGGGGGCCTTGGTGTGTGC' 

AGAGGGGGGCCTGGGGCACGTGCCTGGGCTGGTCAGATGATCAGAGTGGGCT 

GGGCTGGGCCTGGTCTGGGGCCCAGTCTCAAGGGCAGACCCCACCTGGCTAG 

AGTTGATTGTGTGCACACCGGATGACCCGGCGTTGAAGGCCTCTCCTCTCTGT 

GAGCCTCATCCCCACCTGCCAGACTCCCAGCACAGCCTGCTTCCTGCCCCAGC 

TGCTGAGCGACAGCGCTGGGCCGGCTTCTGCGCGCCCCTTCCCCCAGCCCATC 

TTGGAAACCACAGCAGCGTCCTTCCTCCCAAGTCCCTTCCCAGGGCTGACATC 

CCACAGCAGGGATGTATCCCACAAACCCCGCA;iGCCCTGGTGCCTACAGCTT 

GGCCTGGTAACATCAAATCCTACCCTCTCCTCCTGGCAGCAAAGATGGGGTGC 

CCCCACCCCAGAGTTCTCAGCACCCCCAGACAGAAGCAGTCCCCCAGCGACC 

TCAGAACTCTTGGGGCGCTGCCACACCCTTGCAGGAGGGGGCAGTGTTCCTG 

GGATGCTCAGGTCCTGGTATCACCTCTGGCCAGATACGGAAGGTGAAACTAC 
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AGGGCATCCAATTCACCTTGAACTTCAGATAAACACCAGATTATTTTTTTGTA 
TGTCCCGTGCAATATTTGGGACACACTTACCCTAAAGAAGTATTCTGTTTTCA 
TCTGAGAGGCAGATTTAACCGGCGTCCCGTGTCTTCCTGGCAGTCCTGCCCTG 
GAGTCACACTCCACAGGTGCAGGGCAGGGCCAGGCTCCAAGTAGATGGCGGC 
5 CAAAGCACCCGCCCCATGCTCCTGACTCCCGGGGCTCTTCAGGGCATTGCGAA 
AACCAGCAGCAGAGCTGACACCTGGTCCCTGCTCGGGAGCCAGCAAGGCAGG 
AGGCTGCTTAGGCCTTGCGTGTGGGGTGGGCGCACTCCCTGCTGCAGTGCTCT 
TCGTACATGTGACACTGTTCCCGCTCTTTCCCAGCTGGCTGGAGGCGTGATCC 
TGGGTGTGGCCCTGTGGCTCCGCCATGACCCGCAGACCACCAACCTCCTGTAT 
10 CTGGAGCTGGGAGACAAGCCCGCGCCCAACACCTTCTATGTAGGTGAGTGCA 
CATGTGGCCGCAGACGCATTCAGGGAGGGCTTCTAGGAGGAGGCAGGTCCTA 
GCCTTTTGGATGGGGACATGGAGGGTGAAAGACAGTCGGGCATGGCGTGTCC 
GGGCAGGGAGGCGGCCCTGGAAAGGGCTCTGGGCACAAGGGTTGAGATGGA 
GGTGGGCCTGTGGCCTGCTGGCCCTTCTGGTCTGAGCCAGGGCAGGGGGTGG 
15 CAGCTAGGCCTGGGCAGGGACTGTGTGGAGACCTTGCTTATTTTAAGTGTGGG 
GTTATTTCGGGGGAGGCTCCCTGAGAAGGGTGGGGCTGGATGCCTGGGCCAC 
ACAGAGCAGCCGAGGCAGCTGGCGCTGTGGAGCCCGGGAGGGAGGGAGGGA 
TGGAGCTCAAGGGATGGAACCCAGTGAGGGGTGGAGACGGGGCAGGGGAGG 
GGTGGAGAGGGGTGGAGACGCCCCAGAGGCGGTGTGACTCAGCTGCCCCTGG 
20 AGGCAGCTGCACCTTGCTGCCTTATTAGGCTGCGTGTGGGGGACTGGGCTGCC 
CTCCCTGCCCCCAGGAGCAGGAGCAGGAGTGATGGAGGAGGAGGAGGGGAG 
GGGCAAGGCCAGGAGGAGGAGGAGGGCCATCTCACTGTGCAGAGAGCAGCA 
CCCTTCCTCCTGGTGCCCCTGGCAGGGCTGGTGCTGGTGGGGCTCTGGGAGCA 
TTTGTTGAGATGCTTCTGGCCTTGAAAGGAGGCCCCTGGGATGGCTCTGTTGC 
25 CCTCACAGGCTGAGGGGTGGGTGAGGTGGGCAGCCTGTGTGTCCCCAGTCCT 
CAGGGCTTCCCTCAGCCGGCAGGTGCCCCCAGGCCTGGAGCTGCAGGGCCAG 
GCCCCCTGCCAGTTACGGAGGCTGCTTGGCTTGGTTGCTGAACCAGGGCCCCA 
GGAGGCCGAAATAGCCCCACACCTGCGCCGTCCCACCTCTTTGTCCAGTCACC 
CCAGGGCCAGGTGAGGGCCCTGGCCACACAGCGTGCCCGTTCCTTCnr'CCCCA 

30 tgccccgctcatgggtcagagggccggtgct'ggggtccagatggtgtcaaca 

GGGATGGTCCCTGTCCTCCCCAGAGACAGAAGCCTGTGGCCCACGGAGGGTT 
' TCTGGGCCCAGCCGATCCTAGGGAGGGTCCCATGGCCCTGCCCATAGGTTCCT 
GGCCTCTCTCGGGGCCGTGGTGCCCTCACAGGTGGTGTCAGGAAGGACGGGA 
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AAGGCTGCTTGTCCCAGGGGCTCATGTGGAGACCACCCCCTGCACGCAGCTG 
GGGCGCTCCTGCCTGTGTCCTCAGAAGCACTCGGCTTAGCTTTGCCCATGTGC 
CTGGGCTGTGGGTGGCAGAGCCCGGCCAGCATCCTCCGATCTCCAAGGGTGC 
ATCTCTACTGGAGGCCCCTCCTGGGCCTCTTGCTCCCCGCTTCCCAGATCATTA 
5 GGATATTTGGGGTCCAGAAGGGCCTCCCAGCCATCCTGGGCCTTGTCCTCCGG 
GGCCACCAGTCCAGCCAGTGACAACCACAGGATCCCCGGCCTGGAACGAGGC 
TGCCCCCAGCACGTTCCTCGTACTCCTGTGCAGGGACAGGAGGGGCTGCCCCT 
GCCACCGAGTCCCCTTCTCCAGGACCTGGGGCCTGTGGGTGTGAGGCAGGTG 
TTCTTGGAAGGGGTCACTCTCCAGGCACCCGGCGGCCAAGGCTTGTGGCTGG 

1 0 AGCAGCTCCCGCTGTGGGGTCGGCGTCGGGCCCCGTGTGGCCGGAGAGGAGC 
TGAAGGGTCACTTAGCTTCGGGCTGGGGGGAGGACAGGGGACACCCCAGAGA 
GGTATGCCAGGCCTCCTTCCTGCGCCCCACTCTCGGCAGAAGCAGAGGTCAC 
AGGCTGTGCTGAGGCCCCATGGTGCTGCCCCCATGATGCCAGGGTGAGGCTG 
GCGTTGGAAGCAGGTGTCTGACCTGCATGGTGTCACCGTGGCCACATCAGAG 

1 5 CTCC AGCCCC AGAGCCGCCCACCCTCGGTCCTTGGCTGTGGTTTCCCTGGGCT 
GGAGGAGCCTGCCGTTGTGTTGGCCACACGACCACAGGACCTGCCACCCCCG 
ACGTGGGCTCTGCCTGGGCCCCCACTGGACAGGGACCCCTTGGAGCTCCTCTG 
GCCACCAAGTCCTCGCCCATTCCAGAATCGGCCTTCTGGAGCCTCTTGCTGTC 
CCTGATGCGGGCTGGGCCTTGCCAAGGGCTTTTTTTCCTGCGCCGGGAACAGG 

20 GTGGATTTGCTGGGCTCACTCCCCTCAGAGACGCTGCGGGTGCGGTGGGTTAG 
GCCCAAGGGCGTTAAGAGAGGAGGCTGGGGTGGGGCTGGGGCCTGGCAGGG 
GGTCTGGCAGCCCTGGGCCTCCCACCTCCTGTCAGGACCAAAAAAGGCAACG 
CGCCTCTCCTGACCTGTACCCCGGAGTGAACCCAACCTTGCAACCCAGGAGTG 
TCAGGGCCTGAGGGGAGGGAGACCTGGCTCCTGGGTGCCGTGCCCGTAAGGA 

25 GGTGGCCACCTGCAGGGCATTCCTGGCAGAGGCTTCATCTGGCCAGGTAGGA 
GGCTGGGTGGCCGAGCCCCAAATCTGGGTGTGTTCTCTGCCTGGCGGTGGGTC 
CTGCCCCAGGCACCTTCTCCTCTGGGCTGGCTGGGCAGGGACAATGGGCCTG 
GCTGCGAGGAGGGGGCCTGGGCTGCCTTCTGCATTGCCTCGGTGACGGGAGA 
TGGCCCCil r GCCTGCTGAGGGATAGGGGAGTGGGCAGGCAGTGAGAGACACTG 

30 ACAGCTGTCCCGCGGGTACAGGGCCCTGTCTGGGTGGCCAGGCCCATGTCTC 
GGGCCCACAGTGCGCCCCCCACCCTTGGACGGCGCCTTCTCCCTCCCCAGGTG 
CATGCTGCCCAGCCAGGGAGCGTGGGGGAGTTCGGGAGGGCTGGCCTACACG 
CCCTGGTCCAGCTGTCCCAGGTGGGGTGCTGGGCTTCAGCCCTCAGCCCAGGG 
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CCTAGGAATCCAACTTGATCCTCCCCACACAGCAGCCAGGTTCAAATGCAGG 
TCCCGTAACGGAAGTGCTGCTGTGCAGCCCAGATTGGGGGGCAGGAGCCAGC 
AGGGCCCCCCCACCCTCTTCTCGCACCACACTGGGGAGGCAGCATTGGTTCCA 
GTTCCGGTTCCTGGGCTGCCCTCTCAACCCCGGCCTACAGTGGGGCCCACCCT 
5 GTGCCTTCTGATGCCACTCCCACCCCACGCCAAGTCCCAGAGGCTTTGGGAGC' 
GGGTGAAGGCGGTGGGTGGCGGGTGGCAGGTGCAGGCGGTGGGTGGTGGGT 
GTGGCAGGTGGCGGGCCCCACCGCAGGTGTCATCCCTGCGAAGCACCTGTCG 
CCAGCACTCAGAGCGCTCATGAGGTGCCCAGTCCCCATGTGGCCTCCTTAGTC 
TCCGTCCTGTGTCATGGAAGAGGTAACTGAGGCACAGAAAACTCACCAGGCC 
10 AGGCTGGGATGTGAGGTCCCTTGCTGCTCATCCCTGGCAGTCAGCAACCCTAC 
ATCTTCCCAGCTGGGCGGCCCGTGGTGGGTTCGGCACCCAGGACCCTCCGGG 
GTCTTGGGCTGTGGCGAGTGTGTAGGCACCCACCTGGTGTCTCTCTCCCCGCA 
AGGCATCTACATCCTCATCGCTGTGGGCGCTGTCATGATGTTCGTTGGCTTCC 
TGGGCTGCTACGGGGCCATCCAGGAATCCCAGTGCCTGCTGGGGACGGTAAG 
15 GCAGGGAGGCGGGCCTGTGCCTGGGCCGGGGAGGGGCTGGGGGCTGCGTCTG 
GCCCTGAGGAGGGGGCAGAGCTGGTGCTCAGGGCGGAGCCTAGAATTCTGGG 
GGAGGTGGCTCCTGTGCCCTGCITrTCCCGTTTGGTTTTTAAATTAAATCCCAC 
CGTGCTTGGTCTCCATCGTGGCCAGTTCCTACGTGACCGCTTTTCTTTGTCAAA 
AAATAGCCACAAATATAACAGGGAGCAAGCCTCAGCTCTGAGGCCAGCCTCG 
20 GCGTCCCGGGCACACCGCCCCCTGTGGGAAGCCCAGGCCTGGCTGTGCCATC 
CAGGGCCTGGCCAGTCCAGGAAGAGGGAGCCTATGCCCGTGTCTCCAGTGGG 
GGAAACTGAGGCAGATCCCATGGCTCCCCCTTCCGTGGGGAGCAGGAACAAG 
GGGGTGGGGAAGATCAGTCAGGGGTCATGCTGCTGCACACGCCTCCCTGGGG 
GCTGCAGACATCCTGGACTCACCAGCCTGTGACCCCAAACCACACGCCCCGC 
25 CCCATCCACCCCGTCCTGTGGAGCCTGGTGCCGCGTGGGGACATCCTGGGCTT 
TGACGGCTCCTCCCTGCGCTGAGTTTTAGCCTCTGTGCCCCAGGGCTCCACAC 
AAGCCGCTCACTCCTGGTCAGGTCGTGGGCTGGTGGCTCCCACTAGCCCCTCA 
CAGACACGCCTGCTGGGCACCTGGGTGTGTGTCCTTGGGCCCCGCCTACAGCC 
TGCCCTCTTTCCTCCCTCTGGCii ACTGCCCGGCTCCAGTTCTTCACCTGCCTGG 
30 TCATCCTGTTTGCCTGTGAGGTGGCCGCCGGCATCTGGGGCTTTGTCAACAAG 
GACCAGGTGAGCCTGGGTGTGCAGGGACAGGGTGGGGTGGGTGACGGGGGC 
ACCCTCCTCTCCTGTCGCGGGTGGGGGTTGGGCTGACTCATGGCTTGTGGGAG 
CTCTTTGGGCTCTTCCTGGGTCCCACTTGCCAGGAGGATCTCCAGGGGCTTTA 
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TGGAGGAGGCAGCATTGGGGCTGAGCACCAGGCCAGCCTCCCGTGTCCCAGC 

ACTCCCGGGGCAGCTGAGAGTGCAGAGTCCTTGTCCTCTGGGGTCTAGCCTCG 

AAGCCACCCTGCCCAGGGAGAGCCTGGGAAAAGTGCGTCCGCCTGGGGCGGG 

GCGGGGTGGGGGCAAGGAGGGGGAGGTTCCCCCTGTGCATGTGACCGCACCC 

CTCCCCCAGATCGCCAAGGATGTGAAGCAGTTCTATGACCAGGCCCTACAGC 

AGGCCGTGGTGGATGATGACGCCAACAACGCCAAGGCTGTGGTGAAGACCTT 

CCACGAGACGGTGCGGCCCCGGGGGGCGAGGGCGGGGAGCAGGGCCCCGGG 

AACCCGGCGGGGTGTGTCTCGTCCTGGATGAATCCTGCCTACGCCCAGACCTC 

AGGAGCAGGAGGTGCCCTTGGGACCTCCAGGACCCCTGGTCTCAACTGGTCC 

TCGGGTGGGAACCTAGTGGGCCAGGGTGGCCCAGGGTGCGGAAAGCTCTGAG 

CAGCGCAGCTGAGGAGGAAGAAGGCTGGCCCCTGGATGCATTCTGCAGTGGG 

GAGCGCTGCGTACCCCTGGCCACCTCCCCATGGGTTCCCTAGAGCCACCGTCC 

CCCTGGGCACATCCAGGGCTGACCTTGCACCCCTGCTCTCTGCAGCTTGACTG 

CTGTGGCTCCAGCACACTGACTGCTTTGACCACCTCAGTGCTCAAGAACAATT 

TGTGTCCCTCGGGCAGCAACATCATCAGCAACCTCTTCAAGGTGCGCGAGGC 

CGGTGGGGCCGCGCCTGACCCCCCGCATGTCCCGCCCCTGGGTGGGGTCCTA 

GGGGTGGGCAGGTCACACGGCAGCCCCACAGGGAGCGACCACACTGGGTGG 

CATGGCCCCTGTCAGGGCTGCTCTGCTGGGAGGGTTGGGGTGGGACCGCATC 

TGGCCCACGAGGAAGGCAGGCGCCCTGTGCTGCGCATTCCGGGTGAAGAAGG 

TGGAGGCTCTGGGGGGTGGGAACTCACCTGCACCCCCAGCTCCACGTGTGCA 

CTCGTGGGTGTGGACGCCCCTGACAGCCTGTAGCTGGCAGGGCCTGCAGGCC 

ATATAGTGCCCTGTGGAAGTTTCCTGCTGAGGCCTCAGTGGAAGTCGTCATCA 

GTGATGCTTTAGGGGTCTAGTGACACCAATGACCGTGATCTCAGTGGAAAAG 

GGCACAGTGTGTCCCAGGCATTTCGCGTTTATGTTAAAACGGGTGGAAGATA 

GCAAGCCGGCAGAGGCCGGGCCGCTGCACCCGCCTGTTCCGAGGTGGGTAGG 

GGGTGGGGGGCTGTTCCCAGGATTCCCCTCTACGCTTTCTGTGGTGACCACGG 

ATTACTGCGTGACAACGGGAAGCCGGGAGCCGAGGCCCGGTCCCTGACCACG 

CGTGCCTGGCCACCCCTGCAGGAGGACTGCCACCAGAAGATCGATGACCTCT 

TCTCCGGGAAGCTGTACCTCATCGGCATTGCTGCCATrGTGGTCGCTGTGATC 

ATGGTGAGCGGGCGGGGGCGGAGGGCGTGCTCTCTGGGCTGCCCCTTCCGCG 

GGGCCTTGTGCTGACTGCGCCCCCCACCACCCTCCTGCAGATCTTCGAGATGA 

TCCTGAGCATGGTGCTGTGCTGTGGCATCCGGAACAGCTCCGTGTACTGAGGC 

CCCGCAGCTCTGGCCACAGGGACCTCTGCAGTGCCCCCTAAGTGACCCGGAC 
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ACTTCCGAGGGGGCCATCACCGCCTGTGTATATAACGTTTCCGGTATTACTGT 
GCTACACGTAGCCTTTTTACTTTTGGGGTTTTGTTTT^ 

TACCTTTTCAGGGCTGACGTCACATGTAGGTGGCGTGTATGAGTGGAGACGG 
GCCTGGGTCTTGGGGACTGGAGGGCAGGGGTCCTTCTGCCCTGGGGTCCCAG 
5 GGTGCTCTGCCTGCTCAGCCAGGCCTCTCCTGGGAGCCACTCGCCCAGAGACT 

cagcttggccaacttggggggctgtgtccacccagcccgcccgtcctgtgggc 

TGCACAGCTCACCTTGTTCCCTCCTGCCCCGGTTCGAGAGCCGAGTCTGTGGG 

cactctctgccttcatgcacctgtcctttctaacacgtcgccttcaactgtaat 
cacaacatcctgactccgtcatttaataaagaaggaacatcaggcatgctac 

10 caggcctgtgcagtccctcagtgccagtggtgtctgagacctaggggttggcc 
ggagggcaggggaatctgacatcggtggggcttggctctgtggactctgtgg 
ggtccagggtgagggtgggtgggtcgggatccctggtgttcaccaaaggagt 
cactctgtaaaatttggggagttatttattctgagccaaatatgagcaccggt 
ggcctgtgacacagccccaggtcctgagaacttgtgcccaaggcggtctggc 

15 tacttaattgtatacattttagggacataggacattgatcattacatctaaga 

TGTACGTTGGTTTAGTCGGAAAGGTGGGACGATTTGAAGGGGAGGGACTTTC 
AGGTCATAGGCGGATTAAAAGATGTTCTGATTAATAATTGGTTGATTTTATCT 
AAAGACCTGAAATCAATAGAATGGACTATCTGGGTTAAGAGGAGTTGTGGAG 
ACCAAGATTATTATGCAGATGAAGCCGCCAGATTGTAAATGTTTCTTATCAGA 

20 CTTAAAAAGGTACCAGAATCTTAGTTAATTCTCTCCTGGATCAGGAAATAGAC 
CTGGAAAGGGAGGGGGATTCTCTATAGAATGTAGATTTTCCCAAGAGACAGC 
TTTGCAGGGCCATTTCAAAATACATCAGAGAAATATATTTTGGGGTAAAATAC 
TTCGGTTTCTTTCAGGGCCTGCTGTCACGTTGGTATCTTATTACTACAGAGTCT 
GTTTTGTGAGTCTTAAGGTCTTTTTATTTTTAGACAGAGTTT^ 

25 CCAGGTTGGAGTGCAATGGCGTGATCTCAGCTCACTGCAGCCTCCCCTCCACC 
TCCCAGGTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGACAACA 
GGCATGCACCACCCCACCCAGCTAATTTTGTATTTTTAGTAGAGACGGTGTTT 
CGCCACGGTGGCCAGGCTAGTCTCGAACTCCTGACCTCACGTGACACACCAG 

v gttttgggattacaggtgtgagccaccacaccggactaaggtctgtgtttta;a 
30 tgtgaatgctggtcagctgtgcctatgaggcatgttcggccacccacagtcat 
catggcctcaacgagcttttcaggtttactttagaatgcatttggccaagagg 
tgcccattcagttggttggggttgcttagaattttactttgggtttaaaccag 
ggagcaactccaggtagcaagggccctttttgggagcgttctctctattctct 
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TTTGGGAGAGGCCCTGTGTTGCCTGCAGCCACTTCCACCCTGCCCCTTGGGCA 

CACAAGGGGCACACAGTGTAAGCAGGTGGGCAGGAGGGGTCGGGCAGCCAG 

GGAATGCAGTGAGATGGGCTTGGGGTAGGGGCTGGGTGCGCTGCAGGACTCC 

TCTTCCTCCTGAGGGATGGTAAAGGATGGACACACTGCCCCCTCCCGAGCATT 

TGAGGGTCTCTGCCCTGCCCATCTGTTACCTGTAAATGTTCCTTTGAGGAGCT 

GATGGCTCAGGCCTGAGCCACATCTCAGAGGGTCTGGAGGGGAAGAAAGACC 

TCATCCTACTAGGGAGCCCCCCCAGCCCACCAGCGAGCGGTGGTTGGGGGCA 

GACAGGCTGTGGGGCTAAGGAGCCCCTGCACTCCCCCGTCCTTTTCCCTTTGT 

CTGAGCACCTCCAGCCAGTGGGCTTGGTCTAGACTCTCCTATCTTTCCCCACA 

TCGTGGGGTGGGGCTTGCTCTGGGTTAGGCTACTTTTCCCTAGTTGTGGGGAG 

GGGGGTGCTGGCACATTTCACTGTTCCCTGGAGGAAATGAGTGCCTGGGAAT 

TCATATCTAGGGCTCCCAGCAGCCTCTTTGCAGGCCAATTTGGAAACTGTCCC 

CAGCCCTGCATTTTAGGGGGTTACAGAGTCTCTCAGCAGGCCCTCCTCCCCTG 

CTGCTCCCAACTTGCAAGCCTGCACTGGTTGGGAGAACATAATGGTCCAAGG 

AGCCCCCTCTCTACTTTCCGCTGTGTTCCCTGTGGGGAGGGAAGAGCAGTTTA 

AGAAATAAGGAATCCCAAAGGCGCACAGCAGACCGGGGGCCGAGGAGTGGG 

TCCTGCTTCCCCTCCTTTTTTCTAGGCTGAGCCACAGCAGGTCCTTGAATCCTA 

TTTCCCAGCGGATGCCAGGACAGCAGGCCCTGGGGGAGTTCTCTCTCGAGCCT 

TTCAGAGGGACCAGAGGTCTAGCAGCCAAGGAGAACTCAGAATCCTTGAGTG 

TGTGGGGCAGGAACTCTCCCAGCTGAGAAGGGGCACAAGGTGCCAACCATCT 

AGGGCCCAGTGGCCAAGGAAGACGCGGCTTGTCGCAGGGAGAATCTGGGCCC 

TGGTCCTCCCTTTCAGGGCGGGCAGCTGACCTGCCCCCTGCTGCGGACAGGCG 

AGGCCAGGCTGCTGGCTCGCAAGCATGGCGGAGCCCAAACCTTCCCTGCTGC 

CGCCCGCCCAGCCACGGCTGACTTGGAAGCTTGAGGAGCGTTCAGCAGCCTC 

CATCCTGCCCGGGAGGACCGGGGACCTGGAAGGGCCTGGCCCTCGCTTCCCT 

GCAGCGCCCTAGGGGGACGTCTCAGTGCCTCCCGGAGCCCGGACCAATGCAC 

CAGAGCTGAGGGCCCAAGGGTGTGAGGGTGGCCGGGCAGTGGCCCCGAGGA 

CGGCGCCCCACAAGTTTGCGGCCAGGGCCCAGCAAACCCCTAGGGGTGGGAA 

AGCGTCGGCCC^:iCTAGCGGGTCCAGCAGGGCTGCCCCCTTCACCGTGGCCC 

AGCGGTCACGACCCCACGTCCTCATCGCGGGCTGGGACTGCCTCTGCGTCTGG 

CCTGAGCGGGACCGTGGGATCCTGGGGAGCCCCGCCTCGGTGCACTGACAGA 

GCCCAGAAGGAGTGACGGTTACCGCTTCCGGTCAGGACCGGAAGTGCCGGGA 

ACGGCATTCGTCCTCCGTGCGAGATGACGCACTTCCTGCCTGAGGCGGCCGCT 
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gttctcgcggcttccggcaggtggcgctgagaccacgggaagccagcctggc 
tgtcggttagccctcgagcattctgggaattgcaggcctggcccctcctcttc 
ctgttcttggtcaattccggtcttgtttccccaacaaatgccgtcgtttccggg 
gctgcttccgagccggacccaagggccggggcgtggaggagtagaggggcg 

5 agcgcatgcgcacaggactacacgtcccgacaggcgtcgggagcggcggccc 
agttccttgtgggagctgtagttctgcaggcgcggaagccgtggtgctcggcg 
ggcagagcactcggtttcccagagggctgagcgcgccgcacggaggtgcggc 
gccgaccaagatggagactgccgagcagccttgagccggtaggtttgtggtg 
agggaggacgggccgcgcgggccggccgagcctccgggaggtcaccgagcg 

1 0 cagcttt aatacctg agctcgaaggccccgctgtgctcgccgacccccgtacc 
tcgcggccggggccttgggagccacagcatccttgtgaggcccggaggcctg 
tccagcccgactggacagtgccgaggggcaccgagagccagcttggcaccga 
gagttcgtttgttctctggcggggaggtcttgctggcacatatagtggagaaa 
ggccgggctctgcgttcatgtggagaaagagacggcttccttcagcctacgg 

1 5 ac atgaagg agtcaactctaccttccactcgttgccggctttcgccgagaacc 
ccgagaaacggactaccggagtccctatcttgcagcccgatccccgctaccc 
gtcggagtgccccgctgaccaggctgcttctggccgcggcggcgttccgctgc 
agaggacgggagtgcgaatctgggaagcagggttctggttgaactccagctt 
cgtctgcaacatactgtgtgacttgggcaaattatttcccccgccccgttcct 

20 gccagctttaaaacggtcatcagtggggggtgctgcgtatcccctttcactgg 
ggtggcttcttgactgaggagagtcgcgcctcagaggaactga'ggtcctgcc 
tgtgttcgacgtggtggggggcactaagagcccctgatagtacccctgacccc 
atccttattgggtgcacaagacacaggtcactctgggcgggcaaggagtttt 
ggtagcaggagaggagtcggtggatggatggctgaggacagtgcagaaggg 

25 tgtggctgggccgtctttttttgcctggaaattcaagttctgaggcacccagt 
'cactccagcactaaatgggtgcaggaggcagcacttgtctgcccagctggaa 
aggcagggtatgtgctgagtgttacaggtggaaggccactggaggtcgctcc 
aggagccgcggggatttacctctgcctaacagggctgctcaaggtgatggtc 
gacaccccactttcctgagagcttgacqctcagatgccagggccttggctgca 

30 gattccttgggagctcccggggatcttccagcaaataggagcaaatcttttcc 
ccgtggatcaggaaggtgcacgctctttgtggaatacgactgctcaccccgc 
acagcaagcagcttataagtggccctcctgcctgatttcagccctgggttcaa 
gccctgggtggctgcttactaccaaaatcgctcagtagctccaagcctgcctg 
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CAGAGGGTTGGCACCATTAAATGAGGTAACGAGTCAAAAGTCCCTACCCTGG 

GTCCTAGCCTGTCAGGGGCTCCGAAAACCCAGGCTCAGGTCGGTCCTGCCCG 

GCACCTGTTTCACACATGTACACTCCGGTCTGAGGTTGGTGCTCTCCCCCACC 

CCACCCACCTGCAGTTGAGCAGCTGAACAGAGGCCATGCCGGGGCACTCCGA 

GGCCTGAGACGACCACGCCTGTGCCGCTGAGGACCTTCATCAGGGCTCCGTC 

CACTTGGCCCGCTTGGCTGTCCAATCACACTCCAGTGTCAACCACTGGCACCC 

AGCAGCCAAGAGAGGTGAGAGGAGGGCTTGGAGGGGGAGGCGGGACTCCAC 

CCTGTGTGGGACAGTTCTGTCAGTTGACCCTCCACTTGTCCAGGGGCAGTGGA 

TCTGCAGGGGGAACTCATTCTCAATACTGTTCCTCCTGAGAAACAAATTTTCT 

GGGCTGTTTTGGTTTAGGTGTGGCGTGGCCCTGGGGACGCATGGCTGAGGCA 

GGAACAGGTGAGCCGTCCCCCAGCGTGGAGGGCGAACACGGGACGGAGTAT 

GACACGCTGCCTTCCGACACAGTCTCCCTCAGTGACTCGGACTCTGACCTCAG 

CTTGCCCGGTGGTGCTGAAGTGGAAGCACTGTCCCCGATGGGGCTGCCTGGG 

GAGGAGGATTCAGGTCCTGATGAGCCGCCCTCACCCCCGTCAGGCCTCCTCCC 

AGCCACGGTGCAGCCATTCCATCTGAGAGGCATGAGCTCCACCTTCTCCCAGC 

GCAGCCGTGACATCTTTGACTGCCTGGAGGGGGCGGCCAGACGGGCTCCATC 

CTCTGTGGCCCACACCAGCATGAGTGACAACGGAGGCTTCAAGCGGCCCCTA 

GCGCCCTCAGGCCGGTCTCCAGTGGAAGGCCTGGGCAGGGCCCATCGGAGCC 

CTGCCTCACCAAGGGTGCCTCCGGTCCCCGACTACGTGGCACACCCCGAGCG 

CrGGACCAAGTACAGCCTGGAAGATGTGACCGAGGTCAGCGAGCAGAGCAAT 

CAGGCCACCGCCCTGGCCTTCCTGGGCTCCCAGAGCCTGGCTGCCCCCACTGA 

CTGCGTGTCCTCCTTCAACCAGGATCCCTCCAGCTGTGGGGAGGGGAGGGTC 

ATCTTCACCAAACCAGTCCGAGGGGTCGAAGCCAGACACGAGAGGAAGAGG 

GTCCTGGGGAAGGTGGGAGAGCCAGGCAGGGGCGGCCTTGGGAATCCTGCCA 

CAGACAGGGGCGAGGGCCCTGTGGAGCTGGCCCATCTGGCCGGGCCCGGGAG 

CCCAGAGGCTGAGGAGTGGGGCAGCCACCATGGAGGCCTGCAGGAGGTGGA 

GGCACTGTCAGGGTCTGTCCACAGTGGGTCTGTGCCAGGTCTCCCGCCGGTGG 

AAACTGTTGGCTTCCATGGCAGCAGGAAGCGGAGTCGAGACCACTTCCGGAA 

CAAGAGCAGCAGCCCCGAGGACCCAGGTGCTGAGGTCTGAC.^\GGGAGATGG 

CCCAGCCTGACCCCACTGGCCACTGCCATCCTGCTGCCTTCCCAGTGGGGCTG 

GTCAGGGGGCAGCCTGGCCACTGCCTAGCTGGAATGGGAGGAAGCCTGCAGG 

TGGCACCGGTGGCCCTGGCTGCAGTTCTGGGCAGCATCCTCCCAAGCAGAGA 

CCTTGCTGAAGCTCCTGGGGTGTGGGGTGTGGGCTGGAAGCACTGGCTCCCTG 
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GTAGGGACAATAAAGGTTTTGGGTCTTTCTGAGACTTTGTGTCTATCTGGGCC 
CTGCTTACCCAAAGGGCTCAGTTGGCAGCAAGAGCTCCCCACACCTGACCCTC 
GGTGCCGGACCACTCGAGGGTGGCTGACACCTGCATCCCTCACCAGCACATC 
ACCCAGGTGACAGTGAGAATTGGAAACCCCAGGCCTCCTCTAGGGCTTGTGG 
5 CTCAGTGGCAGGTGTCCAGTGAGTGCCCTCAATGGGCCTGAGTGGGTACAGA 
ATCTGCCCTCCCCCAACCAAAGCCCACATGATGCCATCAGCCCCAGGCCTAGT 
GCAGACCACAGCTTGGGAAGCGAAAGGGAGATGACAAAATGGGTGTGGACA 
GAGGAGGCTGGGGTGAGTGGGCAGTTGGGGGGCCTGGGCAGGTGTTTGAGCC 
TGGGGTCCAGTTTTATCTGGGCCACGTGGGGTACAGTGTTTTCCAGGCTCCAC 
10 TGAAGATAAAGCAGCCACTGGTGCCAGCAGGGGCTTTGTCTTGGGGCCATAA 
ACACCAGTGTGTGCTGTACTGGCCATCGGGTGTCAGGGGGACAGCTGACAGA 
TGCAGTGGATGCCCAGCCCTGTCTTCCAGGAGCCAGTCACCTGAGCCCCAAA 
GGACCTTTTGGGTCTTAAGTGACAGAAATTCTAACCACCAGTGTTTCAAAAAA 
AGATGAGTCTCTTGTCAGGCAACACAAGTCAGGGCAGGGGTGGGTGCAGACC 
1 5 CC AG AGC AGCTTCGCAGTCTGGATGGGGGTGCTCCCCCGTCACGCCACAGGG 
CCAGCTTTTCCACAGTGCAGGCTGGTCAGTTTCCAGGTAGGGATGCGGTGGG 
GCACGTTTTCTGCCCGGGGAGTTGTGCCTGTCAGGGGAGACAGCCCCATGAC 
ACTGCTGTGCCTCCGGGGAGGCCAGGAGGGACAAGGAGCGGTTCCTTTGGGT 
GAGGGTCTGTGGTGAGGCACCAGGAGGGCCATTGTCTGCTGGGGGGCTGGTG 
20 GCCCCTGGGGGGTTCCGCTGGAGGTCGGCAAAGGTCAGTGCACAACGTGCAG 
GGTGGCCAGCTGAGGAGAGGTGGCCGCACACGTGGCAGGCCAAGCAGCTCA 
GGTGTCCGAGGGAGGCTGGCCAGGCCCGGGTTGTTCCCAGCCAtCTAAACCA 
CCTCTGTGGTCAGCAGCCACCAGCTGGCTTCCCTCGCCACAGTGCTGAGAGCC 
TGCTGGGAAGGAAGGCCCCCCTGAGCGGTGGGAACCAGCAAGGCAGGCCCA 
25 GGAAGGGGAGAGGTAGCTGGCTCAGAGGCAAGGGCCAGGACGCCGTGGCTC 
TCAGAGTGAGTCCCCACAGGCCAGCCTGGGGAAGATGGGGAGCCCAGAGAC 
AACCAGGCCTCTGCCTGCACTCCTGGGACTAGGAAAGCAGCCCCCTTGAGGC 
CGAGGGGCCCGTTTGCTTGCTGAGCAGGACAGTCCAATAGGGCCCACGCCAG 
GCA'.jAGCCTCAAGAGTCTGAGTGAAGCACATTCGGCCCTGGCCAAGGCAGGG 
30 ACTCCGTCAGAGGGCACCCACACCACCCTGAAGAGCCGCGTCCAGCCAAGGC 
CTCCTGAGACTCCCGGGCCAGGCAGGGAAGGCAGAGCACACAGCACAGCGTA 
CGGCCCCCCAGCCGCTGCAGAGTGGGTGCCCATGCCCACCCTCATCCCACGCT 
CCCCAAACAGGCTTACTCCGGGGGCCGCCACCCTGGGCCAGCACGTCAGCCA 
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CGGAGGACACGAGCACCGAGCAGTAGTTGATCTGGAGAAGGGAAACACGTC 

CGGGAAGCTCCAGGGCTCTCTCAGGACAGCAGGGGACAGGCAGCCTCCCCAT 

CTCCCCTCTCCTGCCAGGGCCCCCGCTGCCCTGTGACTCCTCCCTCTGAGAGC 

TGCCGCTCACAGGCAGGGGTGAGTGAGGCTCCCGTGAGGCGTGTAGAGTGAA 

AGGCGAGGGGTCTTCAATGTGAGACCCCTCATTTAAGGCTGTAGCCTCCCAA 

GTGGCAGGGTGGGAGGAGGCACAGGTGGGGACGGCCTGTCTTCCCCAGAACG 

CTCCTCTGCCCACTGGGGTGCTCCTGTCCTGGCTCTGGCCTGCCTCATCTCTGT 

GAGTGACCGGGCAGGGCTGTGGACCCATCCCAGTAGCCCCACGCAGCCACCC 

GCCTCCCATCAGGGAGAGGAGCTCTGGGGCTCGCTTGCCTGTGACTCCAGAC 

ACTTGATGCGGTTTTGTTGCTCTCTCAGACCCCCGAGGTACTTGGCAATGAAG 

TCCACTCTGCGGCAGGAGCAGGTGGTCAGGCTGTGGATGGCCACGCGGTGCT 

CTGCGTGTGGGGAGCCTCCCCATCCCCCCAGGCCTCCCTGTGTGCCCGAGTTT 

GGGGTGCCCTGGCCCTTTCCCTTCCTCCCTCATGCCCAGATCTGGTGGCACCA 

GGACCCCTCAAAGTGCACCTGGTGCAGTACAGAGCGCTCTATGCCTTCCCAGT 

AGGGCTAGGGTGCCATCCCTGCCTGGGCGCACCACCCAGAAAGACGGTGCAG 

GGCAGGCAGTCGGGGACAGGCCCCTGGAGCCAGGGCTGGAGGGAAGCCAGG 

GGTGGGTAGTCCCAACCCTGAGACCACAGCAGGGAGGGCGCCACGGTCACTG 

TGCACGCTCCCTTTTTGCCTGGCCAGGCGTCCCAGCAGGGTCCTGCACAGGTC 

AGGCTGGAAAGCTAGGCCCCTGGGCTCGGCCAGAGCCCTGCCCCTACCCCTT 

CAGCTTCAGTTCGCCAGCTCAGATCCTCTGTCACTGGCGCCAATGGCACATCC 

CCGCTTAAAGACAGCCCGATACCCACCAGTGATCAGGACAGGCCCCCGCACC 

TGTGGGGGGTTTTCCGCAGCAGCTCCCCCTCGCTGTCCCTCCTCCGCTTCTCCA 

TCTTGCrCAGGAAGTTCTCCTTCTGGACTGTCTCCCAGGTGACGACCTTCTGGT 

CCAGGGGGTCTGGCAGGTCTCTCTCTGGGAAAGCCAGGTGGCAGCCAGCATT 

ACTAGAGCATGTGGGCGTCAGTGGCAGAGCCCAGGCCTGGTCCCGGGGCGAG 

GTGGGCCAGGCAGAAGGAGTGAGTGGGGCCCAGCCAGGGATGGAGGGCAAG 

CATGGCCCTGTGCAAGGACGGCCAAGCTGCGGGTGGACGGGCTCAGCTGTGC 

CAGCCTGCACAGAGCCCAGCTGAGGACCCACAGGGCTGAGGAGGGGTCCAG 

GTCTCTCCAGACCTGCC'tCCAGCGCACAGCCCCGCCCTGGGACCCGCCACCTC 

GGCCTCACCCAGGTGCTCCCGCTTGTGCTCGGCCTCACCCAGGTGCTCCCGCT 

TGTGCTCAGCCTCCTTCTTGAAGACCCGGCGGAGCGTCAGGCTCAGGTGGCTG 

AGCAGGATGAAGGGCGGGGCCAGGGCGGGGCGCTCGTGGTACTCCACAATC 

AGGTTGTAGCGCTGGAACTTCCAGAACATGTCTGCGTTGCCCTGCACCACCTG 
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GAACGTGTAGCTGCAGGGGCACAGCTGAGCCGTCACCTACCGGCTCCCCACG 
ACCACTTGGGGGACGCATCCCCCTGCACCCTGATTGCTGTTGGGGAGCCCTTT 
TTGAAA'CTGAGGCCCAGCACTGCTTCTGCGTTGCCTCTGGGGTGTGTCACGAG 
GGGTCCACTCTCCTTGGGTCTTCCCCTGTCCACTCTGCTCTCCTGGACAAGCCC 

5 ■ AGAGTTCTTGCCACAGCACCGGTCCCCCAAAGACCCAGGCCCTGCTTCCTGGG 
ACTGGGAGGGTGGCCCACAGAGTTCCCGGTCTTGGGCAGGCAAAGCCCTGCA 
GTCCAGGCTATGCCACCTTCTATCTGCCCTGAGGCACCAAGCCTCACCCTCTG 
CAGCACACCAGGTGGGTTGCAAAGGAGGCGGTGTTGGGGAATGACCCTCTGC 
TCCCTCCGCTCCAGGGCTCTCTCCTCCAGGGGTTGGGTGGAGTCACCTGAACA 

1 0 TGGCGATGAGCAGGTTC ATGAGCAGCAC ATTGGTGACCAAC AGGAAGGTGAC 
CAGCAGGAGGATGACCAGCCAGiTGGCATAGAGGCTGGGGCAGGATGGTGA 
GTCCTCCAGCAGCAGTGGGTGGGTGGAGCAGTTCACACGGGCTTCTGGAAAG 
CAAGGGTGCTGTGGGGACCAGACCGGGGGCAGCCACAAGGGCGGCCTTAGG 
CAAATGCCCCGAGATAGAGCGCTGAGTCCTGGTGTTGAACCCAGGGGACGGG 

1 5 GTGCTGGTTGGGTGACCC ACTGTCCCAGTTCACCC AGGGCGGCCTGGGTTTTA 
GCAGTGAAAGCCCCATCACAGCAAACCAGAACCAGACGGGTGGTGAGCCCC 
ACCCTGGGCCCCCTCTGCCATGTCTGCTGAGCTTCCACCAGGGGCTGGAGTCA 
GCCAAGATCACAGGCCCTGGAAGTAGAGCCAAGACCCCTGGAGGGATGGAA 
GGTGGGGGACAGAGGGGTCACGGGACCCAGGTTTCTCCTGGTTGTTGGATGT 

20 GTTAAAACCTCCACCCCTCCTCCGAAAAAAGCCTCTGCTGTTGAAGGTTTGGG 
GAGAGCTGCCGAGGAAAGGGAGATGGGGCCCAGGACTTCTCAGAGCCTCTAC 
AGCACTAATGCCTGTGATTAAACCCCAAGCTGGCAAGTAATTCCGCCACCTTT 
CCAAAATGTATTGAGCCAGCAGGCTCCAGACTGGGCTTGCTGCAAGCTCTGA 
GGGCACCGATCCCGCACTGGGCCACTCAAGTCAAGAACATTCTTCAGGCGGC 

25 ACACTCTGGGAGCTTGCAGGCTGAGCTCCCTCGGCCCTGGGCAGGGACACCC 
TGGAGGGGTGGCAGATCTCACTCCAGTCCCTTAAAACCAGACCCTCACACCT 
GCTTCTCCCTTGTAGGGCTCAACCCTGGACCCTCACCCCACCCCAGCCTGCGG 
GGCAGGAGGAGGGAGGAGGGAGGAAGGAGCCGCAGTGGGC.GTCAGGGTGCT 
CAGGGTCCCAGCCAGAGGCCAGAGTGTGCCOSjGCCGTCCAGGACCACATAT 

30 CCCCTCCATATCCGAATGGGAGCTCCGCTCTCTCCTCCCTCCCCAGCCTGCAC 
GCTGGCGTGTGTCTGTGTGAGCCTGTACATGTGATTGGATGTGCCTGGCACAC 
ATGCTCATGTGTGAGTCTGCGTGTGAGCACACTGGCACACACGGGCACATTC 
AAGTGTATCTTAGTGCCTGCCAGCTCCAGGTTGCCCTGCGGTGGGTTCTGGGG 
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GGTCTCCTTGTTCCCAGGTTCCTGCCTCTGTGCCTGGGTCCTGAGCTGGAGGG 
GTCCGTTCTCCCTCCTAGGGTGAATTCCTGCCTTGTTTCTGCCGAGTGAGTTAG 
TGTGACGCGTTGGCCCGTGTGTGAGTTATTGAGTGTCTGTGTGTGCACTGGCA 
GTCAGTGAGCGCCAGCCCTCCCCACGCTCCCACCCTAGGCCTGCCCCTCCTCT 
5 GTCCGCCCTGGCCACCTCCAGGCCATGAATCTGTTCCTGGGCAGACTGATGAC 
CCCCTCTCCTCCCTGCACTCTGGAGGAGCGGGCCGGGGCCAGCCTGGAGAGC 
CTGTTTGTGAAGCCGTAAATCTGTCTGATAAAGGCAGCTCTGAGGAATGAGA 
CCCCCACACCCTCCCCGACTCTGTGATTTGGGAAGGCGCCACCAGGCCGGGCT 
GGAAATTAAATTGTCCCTGGGGCCTGGAGACCCAGAGAACAAGCTCGTCACC 
10 ACAGCTCAACCCCAGCTCCGGCTCGGCACGAGTGCAGAGTGGGGCAGGGGTC 
TGTTCTGGGCTCTGAGGAGACCTGCAGGCTGTACAGGAACTGTCTGAGCCTCA. 
GTCTCCCTCTCAGCCCAGCATGTGTTAGTGAATAACTCCCTGCCTAGCAGCAT 
GAGGGGCTGGGTCACAGTGAGTGGCTGGGGTTCAACCTGAAGGGCCCAGATA 
CTTGCTCGGCAGCTGAACGGAGCCCAAGGAATTGCAGCTCAGGGAGGGGAAG 
15 TGACAAGTCTGAGACTCTCAGTGGGCGGCCGGGGCTGGCCCTGTTGGAGGCG 
TTTCAGCACGGGGCTCTGAGATCCCCGCAGCACCGCAGACCTCCCTGAGGCA 
GCACTCGGAGCAGGGGCCCCTCGGGAGTCTGTTTCCCTGCCTTCTCCCGTCGA 
GAAGGAGGCTATGGCCAGGCCTGTCTCCTCATACGAGTCGCTGAGACTGGCC 
CACCAGAGATGCAGCCACTCAAGAAAGAGGGGACCCCCAACCCTGCCCTGGC 
20 TGGAGGCGCCCGTCTAGAGCTCCTCCTGCCCCGAGGGGCCCTGCGGACGGGC 
AGTGCTGGAGGCCGTATGGTGAGGGCGAGGTGGGGTCGCAGGCAGGCAGGG 
TCTGGTTAAACTAATGGCCTGGAAAATGAATGCCTGTGATGAGCAAACACCA 
AAATGTCAGGAGGGGCTGCGGGGGCTGAGAAAACAGCCCAAGGAAATGAGC 
CGCGCTGGGAGAGGCGGTTTCCATCACGGCGTCGCCCCCGCCTGTCCCCGGA 
25 GGCCTGTGTTTGCTCCGGTCTGGCCGTCATCCCCTCAACAAATATTTATCXjAG 
GGTCCTGAGCGCCACTCCACAGAGGTCGGGGTCAGGCCTGGGCAGCACGTTG . 
TGCCATGACTGCCCCGTGCCGGGCAGCCCCAGCCGCCTGACACCGCCTGAGC 
AGTTTCCCTGCACCTTCTCTGCCGGGTCTCCAAGTCTGCGGCACCCCTCGAGG 
GCCCCAGGGGCGGGCTGGCTGCTGGTCCACCTTCCCAGAGGGGTGO-TCCCTT 
30 TCCAGGTTGGGGCTCCTCACTGAGTCCCAGCCATCCCAGGGGGCTGCACAGGT 
CCCTGCCCCACCCCAGAGCTCCCCACCAACCCATCACTACAGAGGCCCCCATG 
GCCAAGCGGCTCTGACCCCCGCCCCCTCCCCACTGCACTGGCTTATGAAAAAG 
TTTCCAGAGCCTCCACCAGCCCCAGCCAGCTACAGGGGTCGGCCTGGGGACA 
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CCCATAGGCATGGGCAGGCCACACACACACATTGGGGTGGGTCCCCAGGAGC 
AGCCAGGGTGCCTTTAGGAGCGGCCAAATGGCGAAGGGGCCTTGCCAGCAGG 
ACGGCGTAGGGGCTCTGGTGGGTGCCCTAGGCCAAGCAGGGCCCTGTGGGAG 
GGCAGAGGCCAGGGGCTGTGAGGAGAGCTGGAGGGCCCAGACCCCGGATGA 

5 TGGCTGGGACAGCCTTTGGGCCCACTCTGTGCCCTTAGTGCCTGCCTATCCCT- 
TCCCCACCGATAGACCACCCCCAGAGCAAGACTTGCTGCTTGGCCTCCCCAAG 
CAGGAGGCAGCCCTGGCTGCCTTGGGGTCTCCTTGTGTCTCCTTCCCAGCCCA 
AGGCCCCAGGTCCCAGGTCGGGGCTCCGTGACTGGAGCTGCCTCTACCTCAG 
GAACCAGGCCTGGCCCTTTGGGGTGTGGGGGGCCTGGGGACCAGGCCTCAGC 

10 CACTTCTCCCATCTCTGCTCCAGGCTGGGGTCTCCATGGCCCCAACAGACCTC 
TCTGGAGAGCCTCATGCCCAGGGCITGTGGACACAGGGCTCCAATTTCTCCCT 
GCCCCTGCCCCCGCTGGCTGTGTGCAAACCATCAATCTCGTCCAGTGGGATCT 
GGCCGAAGATCTGCAGGTAGGGCCGGTAGAGCACCCGGCGGAAGATCCACTC 
CAGGCGGCCGTCATGGGGGTGCAGCAGCGCCTGGGTGGTGACACCGTAGGCC 

15 ACGAGCCACACGCTCAGAAAGAAGAGGAAGAAGAAGACGTCCTTCATCTCCA 

cggggcaggggcagagagagggacgtcagcggccagccgggtggggcccag 
gcaggggattgctgtccctccagggccagggccagggcccccgggggctcac 
catgcgctctaccacgatgatcttggggcccagctgcttgtgtatggcaaaga 
tatggatcagccgcagcgtgaacaccatgaagtccatggcgaggactgtgcg 
20 gccagcctcaaacgccgacggcagcatcctggaggatgggaggctgatgcgg 
ctgcggggcccagagaggggcagaggcttccccaggggcacacagcatgctg 
gctgcgggcagggccagggccaggccaggactccttgggccctgagtggctt 

catcttttgtttatttattttt^ 

ggagtgcagtggcatgatcacggcttgctgctgcctcaacttccctggctcaa 
25 gtgacccacccacctcagcctcctgagtagctgggaccacaagtgccaccgc 
cacacccaacctgagtggcttcatctggaagcctgcccacaagccgccctga 
ggccacacggcctctgggccccacagacctgcaggtgacacccacgatgaac 

AGGAAGATGGCCACCATGTCACACTTGTTCCAGTTGtCCCCCACATACAGTGT 
GAACTTCTl'CACCAGGTGTGTGTCCTCGTCTGTGAAGAAGCCCTGGGAGGGA 
30 GGTGGGCAGTCCACTGACAGCTGTCCAGCCGCCCTCGCTGGGGACAGCCCTA 
CCTGGACAAGCTTGGATCTGTAAGGATCAGGGTTCCATCTATGACCAGGGCC 
GAGGCTACCAGTGGGACTAGGGGTCCACTGAAGCTATTCCAGGGCTGTCTGG 
GGACAGAAGGTGAGGTCAGTCTCCACTTGGGGCCAGGGCCTCTTTGTGCCCA 
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GGGTCTAGGGATCAGTTTATGCAGTGTCTGGACTCAGTCCGTGATACAGGTCA 

GGGTTCAGACTCTGACGAGGGAGACCATGGTCATGCTCAGTCTGGAGCAGGG 

CTCAGGGCTCGGTCTTTGACCAGGATCATGGCCGCTGGTGACTGGGAAGATG 

GGAGGCCCCCATGCTGTTCTTGTTTTACAGTTGGGGAGCCCCACCCTGCGAGG 

GCAGGACCCAGCTTAGGTTGCCCAACCTGAAGCTGCAGAGTGGAGGGGACCT 

AGGCTAGTGTGGCCACCGACCTGCCGGATTTCCTCCAGCACCAGCGTAAAGA 

CCCAGAAGTAGAGGGTGACCTCGGGCCCTGAGGGGCCCTGGGGGGGCGGCCT 

GAAGTCCACCAGCAGGACGTAGGTGAACAGGAAGAGGAAGGCGAAGTACAT 

GACCACGTTCCCCAGGAACACAGTCACGGGAGCGCCCCAGAATTTCCGCCAG 

CGTGTGAGCAGGAAGACAGCACGTGGGCCTCGGTCACCCTGAGCCCTCGGCG 

CCTCCACCAGCTCCTCCACCCTGTGCCGCAGAGAAGTTCGCAGTGGTGAGGCT 

GGCGCCCAGCCGGGTGCCCCACCAGAGTCCAGCCTCCCAGCCACCACCCGCC 

AGTCCCCTCCACCCTGCCTGGCCCTGTGCCTCGCACCGGCTCTGCAGGCCATA 

CAGCGGGCTCTTCTCCGTGTCCAGGCTGTCCAGGTCCTGCAGGTCCTCCAGGC 

CTGTCCTCAGGGGAGCTTCCTCACTGCGAGCACAGGAGAGCTCAGGGCCCGC 

AGGAAGGGCTCCCAGAGGCGCCTGCCTGGCTCCAGGCGCTTGGCCATCAAAG 

TGCCCACTGGGATGCAGGGGCTGTGGGGAGTGGGACCCGCAGGGAGGCTGG 

ACTTGGGGGCTACAGAGTCAGGCTACCAGCAGCACCTGCGAGGCCCAGGCCC 

TCCCCGTAGCTCCGGCACTCGCACTGCTGCCTGTCCTGGCCGGGCAGCTCACA 

GGCCTCACCCACCTGAAGGTGATGAGGTTGGTATAGACGAGGGCGGGGCAGA 

GGAAGGCTCCTAGCAGCCGCAGGATGGGCGTGCCTGCGGCCATGTCCCCCCA 

CCAGATCCTGGTCAGGAAGGCCTGAGGTGAAGGCAAAACTGTCGGCTCCAAC 

TCTGCACCTTTGCCCAGGCTGCGCCCTGCCCCAGGAATGCCCTTCCCCCAGGT 

GCCTGGCCCACGTGAGCTGAAGGGGTGCCCAGCCCAGCCCTCCCTCCCGGAG 

CCCCCGTCCTGGTGGGCATGTGCTCCTTGGGCCACTCATCCCAGGGGTGTGGG 

TTGAGCTGCTGTACCCTTCACTAGGGAAAAGATTGGGCTTGGGGGTTGGGGA 

GGACTGAGAGGAGGAAGTGCCACTTCCACTGAGGCCAGAGGCAGGAGGAGT 

GTGCCTGGTGGAGGAAGCCAGGTGGGCGCCTTCCAGGAACAGAAGCCCCACC 

CCGCCCCCTCTGTGCTGAGGACCpGAGTGACAGGGCAGCTGCAGGCTGGGAA 

GTCAACTCAAGCTGGGCCCAGCTCGCCCGCCCACCCCACCCCCTGGCAGCTCT 

CCTCGAGGACAGCTGGCACTCACCTGAACGCCGTCGTGGGCAAAGAAGGCCT 

TGGCGTCAGCCTCGGTGGCCAGGTGCAGGCAGGTGGTCTTGCTCCAGCAGCG 

GTTCCGGCGCACCAGCAGGGCGAAGGCGCGGGCCTCACTGTTGCTGTAGCAC 
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TCGGAGAAGAGGTCTGCCCCCGGAGGCCCTGGCCGCTAGGACGAGACGCCGA 
TGCCCGGCCCGGCCCCCGACGCCCCTCCTCCATCCCCTGCCCAGACCTGGGCT 
CTGCAGCCGCACCCTGCGTTCAACACGCAGGGCCCAGCCAGGCCTTCTGCCCC 
CTCCGGCCTGCTCCGGGCCCTTCCTGAACCTCCGTCCTTCTCCGGCTCTGTCCC 

5 TCCCCCAGCAGGACAAAGGGAACCTCTGCCACGCTAAATGTCCCTCGTCCTGC 
GGCAGGGCCCAGTGTATGCCCGGGGTGTGAGGCTGTATCGTCCTCACCACGT 
GCACAGCGGTCCCACACTCAGGGTCTGGGGCCGCCTCCACCCCAGGCAGAGC 
CGTCTGTCCTCTCCTATGGAGGAGGCCCCTGAGGGCACCtGGAGCCCCACGG 
CGGTAACAGGCAGCCCTGGCGAGGCCCAGGACCCTTCGTCCGACCCCTCCGT 

10 CACATCCTCCCCCGCGCGCGGGCCCGGCCCCAGCCGCCGGTCACTCACCAAG 
GGCCAGCCGCTCGTATTTCGCCTGGGGCGTGGCTGGGGCCGCCtCGGCCTCCG 
TCTCCAGGTGCGACATCTCTTTGAGGATTTTGCAGGCGGCCAGTGCGGCTGCC 
ACACCTTCCTGGCCCTACGAGACCTGGTCTCAGGAGGCCGCCCCTCCCCTGCC 
CCCGCGCTGGGCCCGCGGCCTGGGCTCACCATGGCCCAGAAGTAGGTGGCCA 

15 TCTCGTGGCGGTTCTGCAGCACGGCCCACAGGAACAGGTCCCGCCAGGGGTT 
CTCGCTCTTCTGGTTCAGGTCCAGCAGCCACTTCTGGCCCGTGGGCCGCTTGG 
CCGGGCCCTTCTCCTGGAGGACACGGGCGTCGGCCTCCTGCTGCGGCCCCAGC 
CTCGCCCTCCATCCCCACGGAGCCCCCGCTCACCGCCCTCCTGCGGTCCCCTG 
GCCGGCCGTCCTGGTAGAAGCCTCGGCAGGCGTCCTGCAGGAAGTCCTTGAG 

20 TACGCGGGAGACCTCGTGCAGGGAGAAGGCCGGTGGCCCCGCGGGTGGCTCC 
CGGGCCTGCTGGGTGCCCAGGCCGGCCAGCGTCAGCCGGGCCTCCTCCTGCTT 
CCGGTGCAGCAGGTGGAAGAGCAGGCTCTTGCGTGACACGGAGCGGTAGAGC 
TCCTGCAGCCGCCCATACGTCAGGAAGTCGGCCACGTCTGCGCCGTTGTCCAC 
AAAGAGGCGCACAAACTCGGGCTTGTTGCTGACCAGGGCGTCCACCATGACC 

25 TGCTCCAGGTCACAGGACTTGGCGGCCATGGGCACCCGAGGGAAGGGGGACA 
AGAGAGTGAGCGAGAGACAGGAGGAGGGGGTCCAAGGAAGGAGAGAGCAG 
GGAGCATGGGGATCCATCCCCAGGGCCAGGTCACGCAGACCCTCTCGCTCTC 
CTGCACCCCAGCCTGCCCCCTGCATTGGGGCGGGCACAGAGGGGCCAGTCGT 
GAGCAAATGCTGAGGGTGTCACAACCCCCTTGGGCTTipGGGACAGCCGGCA 

30 CCCAGCCAGCTCTGCCTGGGCCAGTGAGCGAGGCAGAAGCCCGGGCAGAACC 
GTGGCCTGGTCTTGCCCCGGACCTTGGGACAGCTGGGCAGAACGGGGTCAGC 
CAAGCAGCCCACAGGGTGCAGGAGCAGGCAGCGTGGGCAGCTCGGGCAGTG 
CCATGATGGGGAGGTGGGTAGGCAGGGCTGGGAGGCACCTTCCACTCCACGT 
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CCCCATTGAAGATCTCACTCTTGGCGATGTCCACGCGGTCCCAGGCCACGGCC 
AGCTTGAGCTCATCCAGATAGTCCTGAGGCTCCTGGCTGTGGCTCTTGCAGGC 
TGTGGGCAGAGCAGGCAGGCACTGGTGAGGGTGGAGCTGAGGGCCTCACAG 
AGCCGGGGCACAGGGTCCCCAAAGGTGCGCTGCCTAGAGACGGGGAAACGG 

5 GAACTTCACGCTGGGACTTGAGGGGGCACATGCGCCACCTCTGAGCACCAGC 
AGGAGGCACGAGACTTTGTACAAACCGCAGGAGAAAGCGTGCTGCCCTGGTC 
TGCGTCGGGACGGCCCGTCTGCTCCCATGTGAGGCAACTTCCTCGGCTGCAAC 
G^GT^TACCCCAGCTAACACCTGGGCCTCGAGGTI^AUIT'CCTGGCAXrrGAAG 
GTCCAGGACACCCTTCCTGGATGTTGTCGGGGGAGCAAGCTGGGCCACGTTTC 

10 CTCCACGGGCAAGGGCATTTGGCCATGACTTGGTGAGGTCACCAGGGTCACT 
CAAGGGGGGCCTGCAGCACCTGGGCTGGGTGCGCCTGCTGGAAACCGCTGTC 
CTCACCAGACTCGCACGTGAGGCTCACCAGCCTCAGCCTCCACAACCATGGA 
CCCCGTAAGACTACGGTGCCTCTCTCAGGCCACTGGGACCAAACTGAGACTG 
GTGAGAAGAGGCGGGGGAGTTCCTAAACGGGGAGGTTTCTTTGACCTCCTTG 

15 TCCTGGTGAGGTTTCTAAACCCAGAGCCACCCGCAGTCCAGACACCCTTCCCA 
GAGCGTCCCCTCAAGGTGTCCCCAGCCCCTGCTTAGCAGTGCCTCTCCCTGCT 
GGGGCTGTGTCTGCCTGCCAGCGTCCAGCCCTTCAGCACGCGTTCGTCTCCAG 
CAAAAAGCACATCAAATGCTCATCAAAATTTTCCATCAGCGGGTGAACAGAC 
GCATCAACCACGGTCTGTCCACAGGCTGGAACCTGACTCCGCCTTACAAAAG 

20 GAATGAAGCATGGAATGGCATACGCTTCAACACGGAGGAACCGTCAAACCCT 
CCCGCTGCCTGCAGGGAACCATGGACACGAAATGCCCGTCAGCAAATCTATA. 
GAGACAGGAAGCAGATCGATGGTTCCCAGGGGCTGGGGGAAATGGGGAGTG 
AGTGCCAAGGGGTATGGGCTCTTTCTGGGGCGATGAAGGCTTCTGGAATTAG 
ATAAAAGTGATGGTCGCCCCACCTCACGAATATACTAAAAACCACTGAATTG 

25 GCTAGTGCGGTGGCTCACGCCTGTAACCTCAGCACTTTGGGAGGCCGAGGCA 
GGCGGATCACGAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTGAAAq 
CCCGTCTCTACTAAAAATACAAAAACAAAATTAGCCGGGCATGGTGGGGTGC 
ACCTGTAATCCCAGCTACTTGGAAGGCTGAGGCAGGAGAATCACTTCAACTC 
GGGAGGT«vGAGCTTGCAGTGAGCCGAGAtCGCAGCACTGCACtCCAGCCTGG 

30 GCGACAGTGCGAGACTCCGTCTCAAAAAACCGAAAACCAAAACAAACAAAA 
AACAACCCCCCCACCAAAAAAATACCACTGAATTGAGAGCTTTAAGTATAGT 
TTTAAGTTGTGGTAAAAAACACGTCATAAAACTTGCCAAAfACAGACAGCTT 
CATGAATCATGTGTCATCTTTCCTGGGAAGGGGCAGGCGGCCGAGCGTCACA 
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AACATTGGCCAGGCTGCCAAACCCCAAAGCGGCTCTGAGCATGAAGCCCCCC 
AATGGCGCCTGCCTTGCCCACCCTGCCCGCCCTCACCTTTCACCAGCGCCTTC 
AGGATQACCGTGTCCAGCTCCTCGGAGCCCTCCTGCTGGAAGTCATACACGGT 
GAGCAGGTGCTGGTGTGAGGTGATGTTCTGCAGCTGGGCACCAGAACAGAGC 
5 CCCCATGGGGCCGCCTCAGCCAGGACGGGGGCAGAGGCAGGCCGTGGTAGA 
CACCAGCGTAGGCACAGGCAGCGTCCCCAGGTGAGCCTTGCAGCCTGCATGC 
CCACCGCCCACCGGGCCCGGCCTGGGACCACCCGCAGACCAAGGGGCTGCCT 
GGGTCTCTC TGCTCCCTGTGGGCCTGAAGCGAGAGTGGGTGGGGGGCCCAGC 
AGCCTGAGGTGGGAGGAGTGGGCTGGAGGCTGCATTGCAGGAACTCTCCCAG 
10 AGGACCAGCCCCACCCCCGGAGGAGGGGTCGGGAGGACAGGGGAGGGGGCA 
GCACATACCAGCTTGGTCCAGCGCACGATGTCCTCCCAAGAGAAATGCTTGCT 
GGGGAACTTCTCCTTAAACTGCTTCTCGGCCACCTTGGGCACCAGGAGGTGGG 
GCTGGTTCACTAGGGCAGCAAGCACATCGGCGATGCCCCCCGAGCCTACCAG 
GATCAGCCACGGGGCAGCCTGCTCCACGGCCCTGGAGATCCTCTGAGACGGG 
15 GAGGGAGGGGAGAGCGGACCCCAGATTAGCCCGACGACATCTGGATGCAGG 
TGTCAGGGGTGCCCCCATCCCTTCAGCCCTGTCCCAGGGGTGCCCAGAACCCA 
CCGCACCCCTCCACAAGGCTTCGGCCAGCCAGGGGGCCTCAGCCAGGCCCCT 
ACCTCCAAGGTGTTGGGATCACCATTGACCAGCAAGCAGAGGACAGGGATCT 
CGATGCTGCCAGTGCCTGTGGACAGGGCACCCGTGAGGCCTGAGGACCCTCC 
20 GCCTGGGGTGACCACCCGGATCTCAGGCTCTCAGGCCACCACATTTTCTGGCC 
AAAAGCTGGCTCTTCCGAATAAAGTGTGGGCTTCGTCTGGGCCACGTGACAC . 
AGGCTGCTTATTACAGCCCCTGCCATATGAGGGCCGAGGGCTCCAATGCTAA 
CCTACACTACCCAGCACGCGCACTTTGTGGCAACAGCTGGCATATGGCTGTTA 
AAACACAAAGTGACGTGGAAGAAACGGAGCACCCAGCCCTACAGGTGCACT 
25 GCCACACTGCACGCGCTCATGCTGAGCATGGCAGAGGCAGGATCTCGACCAT 
CGAGGGAGCTCTGCTGGGTGGGCTGCGCCTGACCACTCAGCAAACTGAAGAG 
GTACAGGAGGGAGTCTTCAGGTTTCTCCCCCAAACAGAAGCTGGAAATGAGA 
TGTCTATATGTGAAGCCCCCAGAGAGGCCTGGACCTGCCCCTGTCTTCTGTGA 
GGGAGTGAACTGCCTGGAGGTGv.^CTTAGAATCACCCACAGGGAACTGGGCTC 
30 GCAGGCGGGCCCCTGACTTTGAAAGGTGCCTCCTGATTGAATGTGGAGGATA 
AATCAGGCCTGGCCAAACGGTGCCCGGGCCAGGAGGGCTTCCGCAGAGCAGA 
- GCCAGACCCTCCGCCTGTGTCCCTAGAGTTTGACGAGACTTTGTGTTTTCAAT 
TCTAAAGAGTGCTCAGTTGACCGGGCACAGTGGCTCATGCCTGTAATCCCAGC 
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ACTTTGGGAGGCCAAAGCAGGCAGATCACCTGAGATCAGGAGTTCAAGACCA 

GCCTGGCCAACATGGTGCAACCCCGTCTCTACTGGGAAAAAAAAAAAAAAAA 

AAAGAGCCGGGCATGGTGGCACATGCCTATAATCCCAGCTACCCGGGAGGCT 

GAGGCAGGAGAATTGTTGGAACCTGGGAGGTGGAGGCTGCAGTGAGCCGAG 

ATTGCACTCCTGCACTCCAGCCTGGGGGATGGGGGACAGAGTGAGATTCTGC 

CTCAAAAAAAAAAAAAAAAAACAGAGGGCTCAGTTGGCCTTGCCAAGAAAT 

TACAATTCTGTTTCTAAATATTGTATATTGTTCATAGATGTTATAAATAATTAT 

-ATGGTITGTTTATGAATA^H^ 

TTTTGAACGTGCGACTGGCCAGAGGTGCCCATTTCTCGGGTGACGCTGCCCTC 

TGGCGGCAGCATACCTCACGCCTGGCGGCGGGAAGTAGCCCTCTTTCAACTC 

ACGATCTCATTTTGCCTCTTAGAGTCCCGGAGCCTCTGACCACGGCAAGGCCC 

AGTGAAAACAGAACTGCACCTGCTCTGGTGGGAATGTCACTGTCCAGTCAGA 

GCCACTCTGTCCAGCTGTGCCTCCTTCACCACTCTGCCCAGACAGCTCCTCCTC 

CCAGGCCCCCACGGCCCAGGCCCCCACGGCCCAGGTCTCGGTGCTCAGGCAT 

GCGGGGTGCGTTCTGCCTGGGAAGCCCTCCCCCTTCTCCCTGGCGGTCACCCC 

CACTCTCCCACAGGCGGGTCCCTGGATCCCCACTGGGTCCCAAGGCTTCCCTG 

GGTGGCTGGGGCGGGTCTCACCCCCGTAGCCCGCCCTCTGCTCCGAGATGTGC 

TTCTCCAGCCTCAGCCGCAGCTCCGTCAGCCCATCGCCCTTCCCCGGGGGGCC 

TGGCTCCACCAGGATGAAGTGGGAGAGGTTGCTGTCCAGTGAACAGAGGGGG 

CCCTGGCTGCCGCCGTCATCCTCAGGGTAGTGGACAGGAAAATCCTCCTGCGC 

CCATGCAGGGAGACGAGGCTGAGCCCTCGAGAGGCAGCTTGGGCTGGTCCCG 

CTGCGGGATCCTCCCTGCCAGGCCGTGCACACCACGCCATGGGGCCCCGCAC 

AAGGCTTCTGCCCGAGCCTTGGTTTCCCCACCCTTCAGACAAAAGGAGGGAC 

GAACAGCTGGGGCCCGGGGACCCTCCTTCGACAGTCAGGGGCAGGGGCGGTG 

GGGAGCGGGTTCTTTTAGACAGAACCATGGAGACAGCCCAGGCTGCCGGGAG 

CACCTCCTTCCCGACTCTAGGGCTGGCGTCCAAACATCCACCAGGGGCTGCCC 

GGGCCTCCCTCGGTAGTGCCTTAGACAGACCCCAGCATGGGGGACGGAAGCT 

GCTTGCCCAGGCTTGGAAAGGGCTCGGGGCCTGGGCCCGTGTGGTACAGGAG 

GGGGTGAGGGCCCGCGCCACAGTCTACGCTGGCCGI^CCCTATGCGGTACTG 

CCACCTGCCGGCCGTGTGCTGGCTCTGCCCGCTCCCTGCTCTTCCTCCAGCTCG 

AGTCCTGTCCTCCCAGAGCTCTGCCCGCTGCCCAAACCCTTCTGAGCCCCTGC 

CTCCAGGCCCCCAGCGGTCGTGGTGCTGGGAGCTGGACGTGCACCTGGGCCT 

CCTCCAGAATGCGGCGGTGCAGGACGCGGCCCAGCGAGGCCATGCCGACAGC 
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AACCACACGGACCTTGGTGGACGTGCTGGCCAGCGAGTGGTCGCGCACGGCC 
TGCCCGACATGCCTGGCCAGGCCCACGCGGAGGGCACTGGtCAGGATCCAGG 
CTCCTGTGGGGATGGAAGAGGGCCTCGTTGTGCCGCACGCCCCAGGTCTTCCC 
TAGCTGGCCCCACGCCCTAACCCCTAGGCCCAATCAGCAGCCAGTTACCTGG 

5 GAGCCAGGGGTGGGTGCTGGGGAATGCCTCACGGTGGGGTGGGGAAGCCGA 
GGCCCAGAGAGGGCAAGGCACCCACCCAAGGCTGCTCAGCAGCAGAACTGG 
CTGGGCCCAGAACCTGTGAAGTGGCACCCCCACCCCACCTGGAGCGCACAAA 
CTGGGGTCCAGTCCCCCCACCGTGCTCCCTCCGGGACACATCCGACACAACTC 
TGTCCAGCTGCCTCTGCTGGGTGAGAATCCCAGGATGTTTTCCCAAATGACAA 

10 AGCCCTCGGCCCTTGTCTGGCTGGAAGATCTGTTTGGGTTTGCCACAGTCATT 
TAGTGAGGGATCCTGAAACCTCAAGTCCTTTTCTCCTGGGTGTTGTATAGGCC 
AAGCTGCCTCAAAAAACAGCCCAGCCATTGGACTCTGGCCCCTGCAGCTGGC 
GGCCCGACTGCTTCCCCAAGGGGTCCCACCCTGTATACACCCGGGTGCTCTGG 
TCCCAGGTGAGTGGGAGCAGGGACATCTCATGGCGGCCCATGGGCCCTGCTA 

15 GTGGAGGGGGCAGCTCTAAGGAAGGGCCAGGCCCAGCTGGGGGAGGTCCTG 
GTCCAGTACGTAGGCACTGCTGCACCTGTGGGTACTGCAGAGGACGGGCTGA 
GAGTTTCGGGAAATGTTCTCGTGATGGCCTGGAGAGTCAGGGGGTCTGGCTG 
CCGTGTGGGGTGGGAGCATTGCCTGTGCCGGGTGGGGGGACAGTCAGGGGGT 
CTGGCTGCCCTGTGGGGTGGGAGCACTGCCTGTGCCGGGTTGCGGGGACAGT 

20 CAGGGGGTCGGGCTGCCCTGTGGGGTGGGAGCGCTGCCTGTGCCGGGTGGGG 
CCTCACCTGTGCTCTGAGCCGCCTTCACCAGCCCCTTGCGCAGCACATCCCGC 
AGCCAGGACTTCATGGCGAAAGGCTGCTCCTCACCCACCAGGGACACCACCA 
GGTTGGGGGCCGGCAGGTGCCACTCAGCAAGCAGCAGGTCAAAGAGCACAG 
ACGGGGCCACTCCGCTCGGCACCCQTACAAACTGCAAGGCAGGTCTGCAGCT 

25 CAGGGCTTTCGGGGCACGGGGCATGGGAGCTGCTGGGCATTGGGGGGGGCTG 
GACACAAGGGGGCACTGGGGAGGTGCTGAGCATGGGGGGACACTGGGCACT 
GGGAGGTGCTGGGCATCAAGGGGGCTGGACACAGGGGGTACCGGGGAGGTG 
CTGGGCATTGGGGGCAGGTAGCAGGAGGAGAGTGGCCCCCAATATCCCTTCt 
^ GGGACAGGGCCAGTCCTCCCTTGACCTGCTGTGTGACCTGGGCAGTCCTGACT: 

30 CCTCTCTGGGCTGGCATCCCCCTCAGTGAGCCGAGGGCGCTGGACAGAGGTG 
GGGCCCTGGATGCACTTGACCTTGCACCTCCAAATCTCTCGGGGGGACTTCAG 
GGGTCCCGGTTATTGCAGGGAGGTAGGAGGCCGGCCAGAGCCACTCCTTCCA 
GCTCTGGAACTCTGCCTGTCTGGGGGCTCTGAATATGGCCCCAGGGGAGCAG 
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ACCCCAGGGGTGAGGAGGACCCTGGCACTCAGCTTCCAGGGAAGGAAATGG 

GGCCTCATGGGTTCCAGGTCAAGGCGGACATCACCTGAGGCCTGGGGCCTGC 

CCTTACCTTGCCTCGCTTCTTCCCAGACCCTCCAAAGTTGACCTCGCCCCTGTG 

CAAGCCCAGCTCCCGCCGGTCTTCAGCATCCCCGGGGCTTCCGGGACGGGGG 

CCTTGGACATCCTGCATGGTGGCCTCTAGAGCTGCAGGGATACCCTGGCCCTT 

GAGGGCTGAGAAGGCCCATCCCCACCTCCCTCCCCTCAGGGGGCTGGCTGTG 

CTTTCCCCTGGGAAGCCCCTCTCCAGCCAACATGCACAAGGAGACCTGGGGA 

CTGCCCCCAGGCATCTCTGCCCCAGGGACCTTCTCTCCAGCCAGGCCCTGCCC 

AGCGACCCCCTGCAGCTCCGACACTCGCTGTAGCACCGCCCCAACCAAGCTC 

CTTCCCAGGTGAGGATGGGATGTGTGGGGCTCCGCTGTCAGCGCCTGGTTCCT 

CTACTCACCCCACTCAGGGCACCTCAGGTTGGCCACACCTGCAGACATTTCTC 

cagaggccttttgtggggatgacctggaaactccgaaagactcagcagaggc 

cttgcagtcccctggccacacagcccacccgctcccaggacagggatgtgga 

gaccctgggtgcacctgggtagaggtgaaaaggtgagacaggtgccctcagc 

tcaagtgcaggaggcggtgtggacgagagaccaccttccttgcccaggaggt 

ccaggaggcccaggacacacagagagaggaggccttgccccggcccacaca 

gcatcagcagcagagctattcccgtggcgctcccacatcagcaagcgggggt 

AGAAGAGGCCGCGTGCGCCTC AGCCCT ATTCCCTC AACCCTCATGGGGCTCAT 

GTCTCCCAGCTGCCCGGTGAGGTCGGGCATCTCAGCCCCTCTCTATGTGGCAG 

GGATCCAAGGATCGGCACTGGCCCCTCAGCGCCCTGGCTCCGCCCCTGCCCAC 

AGCCTCATCTGTCAGCTGCCCAGCCCCAGCCTGGCTGACCTCGGTCTGGCTTG 

GTCCCACCTCCCTCCGATTGCTCCTCTGAAAACTCAATATTGGAAGCTGGTGC 

TTGAGGACAAGCCCTGGCAGGCCGAGGGCACCTGCTCCTCTGTCTGCCCTCCT 

GGAACCACGCGTCGGGTCACAGGCCGCTGGGCAGAGGTCTCCGTGGGCCCCC 

AGCCACCCTACCCTGTCCGACCTGGGGCTACCTGGCAGCTCCTGATCCCACAG 

GGCTGTCCCTTGGTTCTCCTGTGCCCACCTTGTACCCAGGCGCTCTGGGCTGT 

GTCCACCACAGAAGCCCTTTGTGAGAAGACAATGGACCGTTCTGTAAAAACC 

TTCATTTCCTAGAGAGGGGCCCACCTGTGACCCCAGGAGGGGATACAGGAGT 

TGGGGGGAACTTVAGAGATGAGAGGGAGTGTGTCCTGGGACCCCTGGGGCTG 

GCnTGGGGCAGAGGCTAAGGCCAAGGCCCTCCCCATTAGGCCTCTGGAGCC 

AGCACAGTTACTGCATCCCCAAACAGGTATGTCCAAGTCCCAAACCCCAGGA 

CCTGGGAATGTGACCTTTGGAAACAGGCCCTTTGCAGATGTATTAAGATGTAA 

GTAAAGATGAGGCCATGCTGGAGCAGGGGGAGCCCTGATTCCCAGGTTCTGA 
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TAGAGAAAGGGGACATTTGGACGTAGAGACAGACGCGCAGGGAAAAACACC 
CTTTAAGACACAGGGGAGGGGTGAAGGCAGTGGTTGGGTGATGTGGCCACAA 
GCCAAGGGACACCTGGAGCCCCGAAGCTGGAAGAGGCAGGAAGGATCCCCC 
TCGAGGGCTTGGCCTTGCCGACAGTTTGACTTAGGACTCAGCACTGGTGGCCT 

5 CCAGAACCATGAGAGAATCAATTGCCATTGTTTTAAGATGCCCAGCTCAGGCT 
GGTTGTCTTGGCTGCCCAGGGCCCCCCTGTGAGGCCCAAGTCAACCCCGGCAC 
GACTTCTGGTGACTGCCAGCCCCAGGCAGGACCCACCCACGGGCCTCAGCCC 
CCTCCTCCGGCCAGGCCTGTGGAGGCGCCCTGATGCCCTCAGCACCACG TGG 
GCCTCAGCTGTCCCTGCAGGGTGATGCAAATCCTCCCGCACTTCTGCCTCCAG 

10 AGCAGGAACCAGCCCGCGGACCGCAGAGAGGCAGGATTTGAATACGGCCTCC 
CGCGCTGGTGGAGACGCCTGGTGGTTTTCTGTGTTGGGAGCTGGTGTCTGTGG- 
CCAGGCCCCAGGTGGGGGCTGGAGGCTGGGACAAGGCTTGGCATTGCTGTTG 
TGTCCCGGGGCTGCCAAAACAAAGGGCCACACTCTCGGGGGCTCAAAACAAC 
AAATGGATGATCTCATAGTTCTAGAGCCCAGACGTCCCAAATCCAGGCATCG 

15 GCAGGGCCGTGCTCCCCCTGGAAGCTCTCAGAGCGGGTCCTTCCCACCCTTTC 
CAGGTTCCTGAGGCGCCAGGTGTTCCTGGCTGTGGCCGCGTGGCTCCAGCTTC 
TGCCTTGGTTGCCGCATGGTCTGCCTCGCCTGTCTGTGTCCTCTGCAATCATTG 
ATTGCCCAGGCCCGCCCTACCCCAGCGTGGCTTCATCTTCATGGTGCGCCATC 
TGCAAGGACCCCACTTCCAAACGAGGTCATATTCTAAGGTTCCCAGTGGACA 

20 GTATCCAACCATTAGTGTCGCCCCATCTGGGAGGACACCACTTAATGCAGAA 
CAGGCAGGGAGGTGGGATGTGTGCTTGTAGCCCTGGGCGGGAGTCCTCACCC 
TGGGCGGGGGTCCTCAGGCTGGCCTTCACTCCCACCAGGCCCTGCCACACCTG 
GTCACCCGCTGGGGTCAATCCTGGCTACGTGTGGACATGTGCCCTGCAGCTGG 
GGATGCATGGCCAGGCGTGTCCCCAGCACAGGGTGTAGGGGGCTTGGGGTTT 

25 GGCCATGACCCTCAGCCTGGGGGACGGGACTGGGCTGGACAACACTGTTATC 
CTCGCCTTCCCCATGACTCTTAGGCCATAGATGGGGGACTCCTTCTTCTCTTCC 
CTTCGTCCCTGGTGGGGAAGCTAATGGAGGGCCTGGGCCTTGGGAGGGGTGC 
TACAGGGCTCCCTGGCCCTTGAATTGCTTTCCAGTGAGCCTCAGCTTTGCCTCT 
TGGAAGTTCTGGGATAGGACATCCGGG;\GTACCCTTCAGCCTGGAGCTCGGC 

30 AGTGCCTCCCTGACTCTATGCAAGACCCCCGGGACGACCAGCAGCCATCAGA 
GGTGGAAGCAGCCACAACCAAAGCTCTGGCCCTTCCCAGGGCCCACAGCACC 

tccttcctcccgaggacgg'gacagagctgggcacccagcagctgaggtcagg 

GAAGCCGAGTAGCAGCGTGGGGCAGCCTCACCTTCCTGGTGGCCGCAACCAC 
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CCCTTGCCCCATGCCCAGGCTCGGTGCCCCCCGCCCCCGCCGGCCAGCCCAGG 
GACTAAGGAGGCATTTGGCCAGGTGCTAGGCACCAGGCAAATGGGCCCAGAG 
GGAGGTGATGCAAACAGCTGGGATTTGCATGAGCAGGTGCCCAGGTCTTCTG 
CACCCCTCCCTGTGCATCCCAGGGTGGGGCAGGGAGCCCGGGACAGCCAGGG 
5 GCCATCGTGACCCCTGGCCAGAAGTCCACCCCAGAACTCCCAAGAGGCAAAA 
CTGAGGCAGCCCCCCAGACGGTGAAGGCGCTGGGGCTCCCAGGGCAGCGGG 
GCACCTGTAGCCGCAGCCAAGAGCCAAATGTGGCTTCCTGGGCTCCCCAAGA 
CCAGCTCAGGGCAGAGCTGGATGACCAGGCCCTGGGGTCCCAGCCCCGCTCA 
TCCCCACCAGGCTGTGTAACTTTGGGCAGGTCCCTCAGCCTCCCTGGGCCTGC 

10 CTCATTTGTGAGATTTGGGGGAACGGCCAGTCACAAGGGGAGGGTGCCAGAA 
AGGTCCCTGCGTGCAGCAGGCTGGAAGTCAGCTGTGTGTCCTTGTGTCCTCCC 
TATGAGGATGCCCCATCTCCCTGGGCTGCAGTGGAGGACACGGCTCAGGGAA 
GTGGTGGCAAGAGGACACAGTAGTGACCTGGTGACCCCAGACCCCTCCCACT 
GCCCCACTCCCTTCGCCCTCAGCTTCTGAGCCGCACCTGCCTTTGCAGGCTGC 

1 5 TGGCC AG AC ACTGCCCCTGAAGTCTC AGATCC AGAGCCCGTGGGGCTGGCCC 
TCCTGCCCAGACCCTCTTCCAGCACTGGGCAGCCCCTTCCAGAATGCCCCACC 
TCACCCCAGTGGGGCCGGCCCTTCACAGCCCAGGGAAGCGGGGATCCTCCCT 
GCTAAAGATAAGGAAACTGAGGCAAGAGGCAGGGGTGTGGCCAAAGCCGCA 
CTGTCCTGGTCCCCACTCCTCTCATGCCCACCTGCCCTTCCCACCTGGGAGGA 

20 GGAGGTACAATCCTGCTCCTTCTCCCCACCTGGGCTGTCAGCGGTGTGAGGGT 
CCCTACCTGTCTGCCTTCTCCCCAGCACCCTGACACGCGGCTGCTGGCCAAGG 
CGCGGGTGGAGCCGTCTGCCTACCTTGCAGGGTGTGCCTGTGGGCCCAACTCT 
CAAGCCTCTCCTCCTCCTCTTCCCCGCCCACAACTCCTGGCACCTGGGCAGCC 
AATGCCAGCCCTAGGGAGCTGGCCATGAACCAACACAGGGCACCACCTCACC 

25. TGCTCCACACGGCCGAGCCCACAGGCCTGCTGTCTCCAAAGCCACCCAGACA 
CCAAGGGCTGGCCGTCCTCAGGTGTGATTGGCACCAAGGCCCTGGGGGCATG 
TCCGATTTGACTGGGAGAGGAGACCCTGTGTGGACGTGGCTTCTAGGCCACTT. 
AGATTGCTGTCGTTGCCTCCTTTGGCGATGTCTGGCGGGGTGGCCCGGGACCG 
ATGCAGCTGAAATCCACCCTGAAATGTGATGGGACAGGGCT^TTCACCTCAG 

30 TTTCCACAGTAAGTAGGAACAGGAGCtGCCTCAAGAGCACACGTCTCATGCC 
GGGGTGCTGGGCCCAGCTGTCCTGTGGAGGACAGGGAGGCAGAGGGACTGTC 
TCCTCCCTGCTGTGGCCCTGCGAGGCCGTGGGGCATCGGCCCCCATGGTGTCT 
CATCTGTGCTGGAAAGGCTGCACAGCCGTGGGCCAGCTGTGAGTGATGTCCA 
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TTCACCGGTCAGCACCAGGGATGCTCTGTGAGGACTGTCCCTGCATCCGGAGT 
CAGGACAGGGTGCCAGTGGGAGAGAGTGTGCCTCTGGAATCCCGCAGTCCTG 
GGATTGGATCCTGCATAGCTGTTGAGCTACTTAGCCACTTGGGCCTCAGTACC 
TTGGAGTGTCAAACAGGGACTCTAATAGCCTGGCTTCTGGGAGGTTGCAGGT 

5 AACACATGAGGAGAACAGAGCCTGGGGGATACCTCTGTGTGCCCCTTCCCAA 
AGAGACTCCAAGGACCCCCTGTTCGGGGCCCCTCAAACCATAAACTCCAGGA 
ATGGTGGGGGACAGTATCTGGGACTTGCTCTCTGGGACCCTGGTGGCCTCTCT 
AGTCTGGCCTGTCAGCCACCAGAGGGCGCTAAAGGACCATGGGCAGAGTCTG 
AGGCAGTGGTGGTAACGGGTGAAGGTGATGGTGGTGGTGGTGATGATGGTGG 
1 0 C AGT AGTGGTGGTGGTGGTGATGATGATATTGATGGTC ATGATGGTGGTGGT 
GACAATGATCATGGGGACCGTGCTGATGGCGGTGTTTGTGATGATGGTGGCG 
GTGGTGGTGATGGTGGTGGGCATGAGGGTAGTTGGGGTGGTGATGGCATTAC 
CGTTGATGAGGGTGGTGATGATAGTGATGGTGATGGTGGTGGGAATGATGAC 
GAAGATGATGATGATAGTGGTGCAAATGGCAGTGGTGGTGGTAGTGACTGTG 

1 5 GTGTTGTTG ATGGTATTAGTGCTGATAAAGGTGGTGATGATGATGATGGTGGT 
GGGTGACGAAGGTGGTGGTGGTGATGATTGTGGTGGGGATAATGGTGGTGAT 
GGTGGTTATTGCAGTGGTAGTGTTGATCATGGTGGTGGTAGTGATGGTGGTCA 
TGATGTTGGTGGTGTTGGTGTTGGTATTGGTGTTGGTGATGGTGATGGTGGTG. 
GTAATAATCATGGGATGATGATAGTAGTCATGGTGGTAGTGACAATGGTGAC 

20 AGTGGAGGGGATGGTGGTGGTGATAATGATTGTGGTGGTGATGGTGATAGTG 
TTGGTACTGGTGGTGGTGATGATGGTTGTGGTGGTGGTGATGATAGTGGAGGT 
GGTAGTAATGATGATGGTGGTGGTAGCTGGTAGTGGTGGGAGTGGTGGTGAT 
GGTGATGATTGTGGTGATGGTGATGGGGTAGTGGTGGTTATGGTGGTGTTGGT 
TGTTATGGTGGTAGTGATGATCATGGTGGTGGTGACTATAGTCACTGGTGATG 

25 CTGGTGTTCATGGTGATGGTGATGATGATAGTGGTGGTGGTTGTGGCAGTGCT 
GCTGCTGATGGTGGTGGTGATGATGAGGGTGATGATGACAATGAGTATGATG 
ATGGTGGTGGTGATGATCGTGGTTGTGGTGTGTTGATGGTGGTGTTTATTGTG 
GCGGTGATGATGATAGTGTTAATAATTGTGGTGATGTTGGTGATGGTGGTGAT 
GGV GACGATGATGGTGATGATTGTACTGGTGGTGTTGGTGATGGTGTGGTAC 

30 GATGGTGGTGCTGATGGTCATGATGGTGGTGGGGGTGATAATGATCCTGGTQ 
GTGGCCATGGTGGTGTTTTTGATAACGTGATAATGTTGGAGATGGTGGTGATT 
GTGGTGGTGATAATGGTGGTCATGGTGATGGTGATATCAGTGTGGGTGATGT 
AGTGGTGGGGATGGTTCAGATGTTGGTGATGGTGGTGGTGATGGCAGTGATA 
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GTTAGGATGATGGTGATGATGATGATGGCATGGGTGCTTGATATGGTTTGGCT 

CTGTGTTCCCTTCCAAGTTGTAATCACCATAATCGCCATGTGTCAAAGGCGGG 

ACCAGGTGGAGGTAATCGGATCATGGGGGTGGTTTCCTCCATGCTGTTCTCAT 

GATAGTGAGTGAGTTCTCATGAGATCTGATGGTTTTATAAGCATCTGGCATTT 

CCCCTCACTCCATCCTGCTGCCCTGTGAAGAAGGTGCCTCTAAGTCTATGGAG 

TTGGGGGGCACAGTCCTTCATCTATCCTGGTCCTTCCCTCCACCTCTGTCTCCT 

GGACACCCAAAAAAACTGCAGAAGTTCTTCCCCAGCCAACTCTGGTTCAGTG 

CCAGGTCATGTTCCTCCCCGCCTACAACCTTCCATGGCTCCCTATTACCCTGAC 

AATAAAACCCAAAGAAGCCTTGGAGCCTTGGCTTCTGGTGATCTGCTAGGAT 

CACTTTTCCTTGGAACTCCTGCTGGCTCCCTCTTGCCACTCAGATACTGGCAG 

AACCTGGAGTGGGTGCTGGCTTGCACTCACTCCATCCTGCTGCCCTGTGAAGA 

AGGTGACTTGCTTCCCCTTCATCTCCGTCATGATTGTAAGTTTGCTGAGGCCA 

CCCCAGCCATACTGAACTGTGAGTCAACTCAACCTCTTTCCTCGGTAAATTAC 

TCAGTCTCAGGCAGTTCTAATACAGTGGTGATACTAGTGGTGGTGGTGGTGGT 

GGTGGTGGCTGGTTGTGGTAATTACAGTGGTAGTGGTGATGGTGGTAGTAAT 

GATGATGATAGTGCTGGTGGTGATGTTGGTGGCAGCGGTGAGGGTGGTGGCG 

GTGATGGTGACGGTGTTGATGGTGGTGGTGGTTTTGGTGGTGGTGATGTTGGT 

GGCAGTGATGAGGGTGGTGGCAGTGATGGTGACAGTGTTGATGGTGGTGGTG 

GTTTTGGTGGTGGTGATGGAGGCGGTGTTGGTGTTGGTGGTGGTTTTGGTGGC 

GGTGATGGTGATGGTGTTGGTGGTGGTGGTTTTGGTGGTGGTGATGGTGGTGT 

TGATGGTGGTGGTGGTTTTGGTGGCGGTGATGGTGATGGTGTTGATGGTGGTG 

GTGGTTTTGGTGGTGGTGATGGAGGTGGTGTTGGTGGTGGTGGTTTTGGTGAT 

GGTGACGGTATTGGTGGTGGTGGTTTTGGTGGTGGTGATGGAGGTGGTAGTG 

ATGATGGTGGTCATGTTTATGGTAGTGATGACAGGGATGATGCTGGTCGTGGT 

AATAGTGGTTATGGTAGCAGGCCATGTTATAACGCTAGCAGCTTTGCTCTGGA 

TGGGAAGGCTCCAGGTACTCCCAAAGTAGGCACCTATGGGCTGCTCTGTAAA 

TGTCCCTTTGTCCCCTAGCATGGCTGAAACCCTCATCCTGAGAGGAGAAGCAG 

ATATCAGCTTCCTCAGTAAATGCAGGTTGGTGTTGAGCTGAGAAACCCAGAA 

CTTGGCTCTGAGC AAAT'i GGACTC AGACC AGGG ACCCC AC AAGCCCTC AGGC 

CTGAGACCCTAAAATCGTAGTTCTGAGAGCGAGGGGCCTATCTTGGGGGCCC 

TGGGCAAGGCGGAGGGTCCACTAGACCCTGAAAGTCCAGCCTAAGCTCACAT 

CCACTCCTCTAGCCTGTGTCGTTGCCCTACAGCAGGTTCTGCCAGTGTCTGTG 

GTATGTGAATAATGAGGAGGTTCCCCCAGATCCTCTAAGCCTATGGGGTTGG 
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GAGGCACAGTCCTTCCTCTATCCTGGTCCTGCCCTCCACCTCCATCTCCTGGAC 
ACCCAAAAGACTACAGAAGTTCTTCCCCAGCCACCCCTGGTTCAGTACCAGGT 
CATGTTCCTCCCTGCCTACAACCTTCCATGGCTCCCTATTACCCTGACAATAA 
AACCCAAAGAAGCCTTGGAACCITGGCTTCTGGfGCTCTACCAGGAGCACTCT 
TCCTTGGAGCTCCTGCTGGCCCCCTCTTCCCACTCAGATCTCAGCTCTCATGTC 
CTTCCTCAGAGACCCCCTCACCCTAGACCACATCAGCACAGGGTGCCTCCCTC 
GGTCTCTTCCCCTGTGATCTGGTGTTAGTTTTCAGTATGCCTCACCCTCTGAAA 
GGTCTTTCATTTACTTTTTTGCTTGTCTGTGTCCCCTGCATGAGACTATGGCCT 
GAGCATTCCCTGCCACCTCCCTGGGGCCCAGAACACGCAGGTACAGACCACA 
TGTTCTAAAGTCAATGAATGAATGAGTGAATCAATGGGTGAGTGTCCAGGCG 
TGTCTCTGTGTGAGCACTGCCATCAAGGCAGAGACGGAGGTGTCTAGGGGAG 
GCTGCATTTTTGCAAGGAGCAACTTCCTGGCCCACACTCACGCCCTGGGTGGA 
AGGGCAGACACTGGGCCGGGGCTCCACTGTCTGCCCGCTCTAGCCATGGGCA 
CTGCTGCCAGGGGAGGAGGTGCCCAGGCGCTCACCTGCGTCCATGCGTGCTG 
TCATGAGGGACATCCCAGCTGCCGACAGAGCAGCCTCTGCCTGCCCTCTACA 
GCGGGTTTCCTCCAGCGGCCTCTGTCTTCTTTGCTACAGTTGGCCCCGGGCAG 
GCTTTACGTTGGAACTTCCTTTGGAGGAGAGGCCACGCCAGTAGAGCAGGAC 
TCCAACAGGGCCCAGCCCCTGCCCCCACGGGGCCTCTATCCCTGGGACAGCA 
GATCCCCTTTGGATCTCCAGCAACCCCAATGGCTGACTGCAGAGGCTCAGCTC 
TCTCCTGCCAAGCCCTGCCATGCGAGGCTGGGCCCACAGGCTGAGAAGCTGA 
CCCCAGTGTCCAGGAGAGCGTGAGCCGTCCCTGACCGGGGTCCTCGGGCCTC 
TGCTGAGAGAGAAGGAGCTGCAGGCTGGCTCCCCACCTGCACCACAGGCAGC 
CTCCCCCCAGGCACATCCTTGGGTGCCATGTGGCATTTTATTTACAGAAAGAT 
TTGCAACCCACTTCTTCTTTGTCTTAGAGGGGAGAAACCACCACCCAGAGTAG 
GGCCCGCTGGCCCAGGGTCCCCATGCCAGTGTGGTGGGCGCCCCCAACCCAG 
CATCTCCCTCTGAACCACACACAGTCTAGCCCCAGGTGGGAGGGAAGCAGCC 
CGGAGCCCCTCAGACAGGAAAGCTGATGGTGGGGGCGTCAGGTCTGACCCCT 
CGGGGGGCAGCCCGAAGCCATGCAGTGGGGCAGCAGGGCCATGGGCAAGCT 
CTGCCAGTGGTGCCTGTGTCCAAGGCCTCCGGr-CAGATCCTGAGTGCCCAGC 
30 AGTGGAGACCAGGAGTCCGGGAACCAGTGGAGCCGCACGTCCGTCCCCACTC 
CATTAAGTTCCCTGGCCCGGGCAGCGCCTGCAGCGTGCCACTGCTGTCTCGGC 
ACCATCCTTGCAGCATCACCTTTTGCCTGTCCTTCTACTCTGGGAGGGCAACC 
TTGACCCCATCACAAGATGTGAACAGAGGTAGAGGCCGTAGGGGCCAGCGAG 
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GCCAAATGACCTGCCCATAGCAATGCCGCCATGAGGCAAGGCCCACGCCCTC 

TCCCCTGCTGTCCTGTCCTGGAGGTTCCACAGCTGCAGGAAGACGTCCCCCTA 

CCCTGCTCCCCGCAGAGGGTGCTTGGCGATCAGACAGCAGATCTGCAAAGGA 

AGGTGTAGCCTTCTTAGTGTGCGCAAGGGCtGCCCGG'CCAACTCCACCATTGA 

GGGCAGATGCTGGCTGTGGAGAGGCCGAGCGCTGCCCGGGCCCAGTGGCTGG 

GGTAGGGGGCTCCATGCACTGTCCATGAGGGCATGTCCATCTGGCATATGTGC 

ACCTGTGTGAGTGTGCTGTGTGTGGAGCGGGCACTGGGCCAGTGGTCAAGGC 

TGCAGTAGGACTTTCAGTGATTCCAGATCCTGTCTGGAAATGGGGATGATGAG 

AATTCCTTCCACTGAAGGGTGCGTAAGGCCTGAGCCAGTGTGCCTGGAACGT 

CCTGAGGACCAGCGGGGCCGGGGCCAAGACCTGGACAGGGCCTGGGTAATTG 

CGAGAAGCAAGGACCAGCGTGTGCCTGTGTGTGTCACTGTGCGCACATGGTG 

TGTGTGTGTGCATGCATATGTGTGGACACCGTGGGCATTTCTGTGTGGATGTT 

GTGTGTGTGTAGCTGCACTGTGTGTGCGGACAGCAGGTGTGCCTGTGGGTGTG 

TCTGTGTGACACTGTATATGAGTGTATGTAGACATTGTGTGTGGCTGTGTGTG 

GACACAGTGTGTGAGTGTATATGTGTGTGTAGGTGGACTGTATGTGTCTGTGT 

GGATCTGTGTGGATGCTGTGTGTGTGTATGTAGACACTGTGTGTGGCTGTGTG 

TGTGGACGCAGTGTGCGCGTGCATGTTTGTGTAGGTGCACTGTGTGTGTCTGT 

GTGGATGCTGTGTGTGAGTGTATGTAGACACTGTGTGTGGCTGTGTGTGTGGA 

GGCTGTGTGTGTAGGTGTACCATGTGTCTGTGTGGACGGTGTATGTGTGTGTA 

TGTGCACCATGTGTGTCTGTGTGGACGCTGTGAGTGCATGTGTGTGCGTGGAC 

ACTGTGTGTGCGTCTGTGTGGACACTGTGTGACTGCATGTGTGTGGCTGCACT 

GTGTGTGTGTGG AC ACTGTGTGTGC AAATGCTGTGTGTGTGTGAATGTGTGTA 

GGTACACCGTGTGTGTCTGTGTGGATGCTGTTTGTGAGAGTGTGTGTGTGTGT 

AGGTGCACTGTGTGCGGACACTGTGTGTCCGGCTGTGTGTGTGTGTGTGGACA 

CTGTGTGAGTGCATGTGTGTATAGCCGCACTGTGTGTGTGTGTGTGGATGCTG 

TGTCTGTGAGAATGCTGTGTATGAGTGTGTGTGAGTGTATGTGTAGGTACGCT 

GTGTCTGTGTGGACGCTGTATGTGAAAGTGTATGTGTGTGTGTGTAGGTGCAC 

TGTGTGTGTGTGGACACTGCATGTGTGTCTGTGTGGAAACTGTGAGTGCATGT 

ATGTGTAGCTGCACTGTGTGTGTGTCTGTGTGTCTGTGTGGACACTGl'GTGTG 

TGTGTCTGTGGTGTGCAGGTGCTGGAAGGCAGGCTCTGCAGGTGCAGCTGGA 

GCTTCAAAGTGTGAGCATTGGGCCAGGAGGACCACAGCCTGGCTGATCCAGG 

GGAGAGAAGTCGTTTCTGAGGTTCTGGAAATGACAGCCATTGGGAAGCACCA 

CACCGCCACTCATCCATGAACTCAACCATCCGATCAGCCATCAGTTCATGCAG 
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TCATTGGATACAGATTTTTGCTATTCATTTATCTACTTACTTTTGTAGAGATGT 
GGTCTCACTATGCTGCCCAGACTGGCCTCAAACACTTGGACTTGAGCAATCCT 
CCCACCTCGGCCTCCCACAGCACTGGATTACAGGCATGAGCCACCACAGCCG 
GCCTGGGGAGAGATTTATTGAGCGACTTCTATGTGCCAGGCACTGTGTCAGG 
5 GGGAGACACAGACAACAAGCAAGGGGATACGTGAGAAAGAAGGAAGATCAA 
TCGATGATTAAGATGGCTATGGAGAAAAATAAAGCAGGGGAGGCCCAGCGG 
CTGCAAGATTTTATTTTTAAATGGGCGGGTCGAGGAAGAATTCATGGAGAGG 
CTGCCATCTGTGCAAATAACTGGACGAAGGGCGGCCCATCTGCTGGTCTATCC 
ACCCACTCACTCACCCACCCACTCATGCACACATCCACCAGCACATATGCACC 
10 TCCTCACCCATCACCACCCACCCAGCCATCCAGCCTTCCACCAGCCTGCCCAC 
CTGTCATTCACCCATGCACCCATGGCAATCCATCCCTCCATCCGTCCATCTGTC 
CATTCGTCTGTCCATCCATCCACCCGTCTGTCTGTCGATTCATCCATCTATCCG 
TCCATCTGTCTGTCCATCCATCCATCCATCCATCCATCCATCCATCCATCCACC 
CATCCACCAATCTATCCATGCATCCATCTGTCTGTTCATCCATCTGTCCATGTG 
15 CATCTGTCCATCTGTCCATCTATCCATCCATCCATCCACCAATCTGTGTATCCA 
TGTATCAAACTGTCCATCTGTCCACCCACCCACCCATCCATCCATCTGTCCATC 
TCTCTACCCGTCTGTCCATCACTTCACCCACCTGTTTTCTCCTCTCTCCAGTTA 
CCCACTCACCACTTACTAACTTCTCCACAGTGCCTAGTCTCTCCAGGATGGCG 
AGTCAGTCTCTGGTCTCAGGATGCCCACACTCTGGGGAACAGGCAGCAGGTA 
20 GCCATGGGGGATTGGCCAGAGCACCCCAAGAGCGTAGGCAGTTCTCGTGGTT 
TCTCCCCAGCCAGCGTATCTCCCAGAAGAGGTCACTATCCTCGAGGCCAGGA 
GCCTCACCTTCCGCTCCCCTGACCTCGCCCACACCGTAACCCAGCAAGGCTGG 
GAATTCAGTAGGTGCCCAATAAATCCCGGCAGGCTGGCTGAGTCATCCCACC 
TCCTCCTCCCCTACTGATCACCCATTCAGCTCCTGGCCCCATTGCCCAAACCCT 
25 GCCAGCCTTTGGGTGATCTGCTCAGCCGCCACCTCACATCCACTCTCAGCTCG 
GGGCCAGCTCTGGGCTCCAGCCCGTGATGGGCCATGGTGGGCTTCAGAGGGT 
GAGCCCCCCACACCATTGTACCAGTCTGGCAAGACGGATGAGTGCCCTGGTC 
AGTAGTGGCCCACCTTGGCCTGGAAGGATGTAGGGACAGGCTGCCCACCCCC 
ATTTGCCC 1 CCTTCTCTGAGTGGCTAGGAC AGCC ACC ACCACTCTCC AGGG AG 
30 GGGAGGGGCTGGCCTGTCACACGTCCCGGACAGGCCTGCCTGG'GCCCAGGAA 
AGGGCTATGTGTGTAGGTGGCAGGCTGCTGCCCTGGGCAGGGGCCCCGCCAG 
GGCTGGGGAGGCCTCAGACCTTTAGATTGTGGGGCTGGTCCCCAGCACTCCC 
ACTGGGGGTGGGAAGCTTGGCGGCCAGTGACGGGAGGGTCAGCAGAGGGGC 
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CCGTGCTCCTGGCTGAAGCCAA.CCTCCCACACAAGGTGATGCCGACAACTCC 

AAGAATCGATGCTGTGTCAGGAGCAGTCTCCGGAAGTGATTTCCGCCAGATG 

GGCTGGGCCCAGGTGGGAATCTGGGAGCTTCCCCAGAACAGGGAGCATGAGA 

CCAGCTCAGAGAGTCTGCAGTGACATCAGATGACCTAACCTCAGATCAAAGT 

GAACGCAGCCAGAAGCACCCCGCTCGCCCACCTGGGCACTGGGCCGAGACCC 

TGGGTCCTGGGCTGGGAGGAGTGAGTGTGGGTGTGTGACCTTCAGGGAGGGC 

CCCGCCACCAGCCATGAGCCCACAGGGTGTAGCTCCCCACCTCTCTACTCTGC 

CCCACTGCTTTCTGTCCTGGGTGCACCTTTCCCAGCCTGCTGGATCCTGCTGCT 

TTAAGGCTGCCTGTCTTAGGAGAAGGGATGGGGGATCCCCAGGTCCTTCATG 

GTGACAAAAAGGCTCTTTCACACATGCACACCTACACATACTTGCACACATGT 

GGGCAGACACACATACAGCCTCAGTCTTCCCTGGCTGGTTCCCAGCTTGTCGT 

CTTGGAAGAGGAGGAGGTGGGGTACACCCAAGCTGTGAATTGGGAGACAGG 

TCATGACACACCCATCTCCTACATCTCCTGATTTCACAGGTGAGAAAATTGCC 

TTGGTGGGGAGCCCTGGCTTCAGTGTGCATGTGGACCTGCTTCTGGGTGGGGG 

GTGCCTTTCTCTATGAAGGTGTGTCTGTCCTGCACCACTGTGGGATTCAGGAA 

GGCAAGGCCTCTTGTGCAGTGTGTGGGGCACAGTCAATGCCGGTGCTGACTG 

TGGAGTGACCATCTGACATAGCCTGCAATCAGGAAGGACCATGTCAAGGCAG 

AACTGAAGGACCCTGAGCTAGTTCTGGCTCTGTCCTGCTGTGGTGCCCAAACT 

TGGGGACCCTCAGGACATTCCAAACCTCAGGGTCTCCTGCACAGGAGAGAGG 

GGAGTACACTAAAATGCAGATTCCTGGGGCCGACCCAGCCCCACTGTACTGG 

GTCTGCATTAGAGGCTGTTTACAGAGCCCCCAGGCAACTGCCCTGGGGTCCCC 

TCCCAGACCTGCTGGTAGGGGTGGGTCTGAGCTGACCGGTCTTTATGAGCATG 

CGGGGGCAGGTAGTTAGGGAATAGGTGAGGCAGTGATGAACCAGGAACCGC 

CTGTGGGGTTAGGATCAAGAACAGGGGCTTGCTGATGAACGTAAATATAAAA 

ATCCTCAACAAATACTAGCAAACCAAATCCAGAAACACATCAAAAAATTAAT 

TCACCACAGTCAAATAGACTTCATTCCTGGGATGCAAGGTTGGTTCAACATAT 

GCAAATCAATAAATGTGGTTTGTCACATAAACAGAACAAAAAAACCATATGG- 

TCCTCTCAATAGATTCAGAAAAAGTCTTTGAAAAAATCCAAACATCCCTTCAT 

GATAAAAACCTGCAACAAACTAGL'CATCAAAGGAACATACCTCAAAATAATA 

AGAGCTGTCTATGACAAACCCACAGTCAACATCATATTGAACGGGCAAAAGC 

TGGCAAAAGCATTCCCCTTAAGAATAGGAATAAGACAAGGATGTACACTCTC 

ACCACACAAAAAAATTACCTAGGAATACATCTAACCCAGGAGATGACGCATC 

TCCACAAGGAGAAACACTCCTGAAAGAAATTGGCTGGATGCAGTGGCTCACA 



WO 02/054069 



184 



PCTAJS01/49808 



CCAGTAATCCTAGCATTTTGGGAGGCTGAAGCGAGCGGATCACTTGAATCCA 

ggagttcgagaccagcctggccaatatggtgaaaaccccaactctacaaaaa 

aatacaaaaattagccaggcatggtggcatatgcctgtagtcctagctactg 

gggcggttaaggtgggaggatcacctaagcctgggaggttgaggctgcagtg 

5 aaccaggactgtgtcactgcactccagcctgggtgacagagtgagaccctgt 
cttaaaaaaaaattacagatgacacaaatggaaacacattccatgctcatgg 
ataggaagaatcagcatcattaagatggtcacattgtccaaagcaatctaca 
gattaagcgctattcctattaaactaccaacgtcatttttcacagaattagaa 
aaactactctaaaattcatatggaaccaaaaaagagccagaatagccaaggc 

10 aatcctaagcaaaaagaacaaagccagaggcatcacattacctgacttcaaa 
ctaatgttcaaggcgactataaccaaaccagcatagtactggtacaaaaata 
gacacatagaccaatggaacagaatagagagcccagaagtaaagccacaca 
cctacagtaatctgatcttcaacaaagtcaacaaaaataggaaatggggaaa 
agactccatattcaataaatggtgctgagagagctggcttgccatatgcaga 

15 agaatgaaactggacccctaattttcaccatatgcacaagtgacacaagatg 
gattaaagatttaaatgcgagacctcaaactataagactactaagagaaaac 
atcggaaataccattctggacatcagcgctgggaaagaatttatgattaattt 
ctcaaaagccattgccacaaaaatatacattgacaagtgggacctaattaag 
ctaaagagcttctgcacagcaaaagaaactatcaacagagtaaacagacaac 

20 ctacagaatgggagaaaatatttgcatctatgcatgtgacaaaggtctaata 
tccagaatctataaggaactcaaacaattcaacaagaaaaaaataaataacc 
ccgtaaaaattgctcaaaggagatgaacagatgcttctcaaaggaagacatc 
catgcggtcaacaagcatataaaagaaagctcaacatcactaatcattagag 
aaatgcaaatcaaaaccatagtgagataccatctcacaccagtcagaatggc 

25 ctttgttaaaatatcaaaaaacaatacatgctggcaagagtgagcagaacag 

GGAACGCTTATACACTGTGGGTGGGAATGTAAATTAGTCCAGCCACTGTGGA 
AAGCAGTTTGGAGATTTCCCAAAGAACTCAAAACAGAAGTTCCATTTGACCC 
AGCAATTCCATTAATTCCTGGATACATATACAAAAGAAAAATCATTCTACCAA 
AAAGACACCTGCACTTCCATGTTCATCGCAGCACTATTC;\CAATAACAAAGA 
30 CCTGGAATTAACCCAAGGTGCCCAACAGTGGTAGACTGGGTAAAGAAAATGT 
GGTATATACACACCATGGAATACTATGCAGCCATGAAAAAGAATAAAACCAT 
GTCATCTGCAGCAACATGGGTCCAGCTGGACACCATTATCCTAAGCAAATTA 
" ACTGAGGAACGGAAAACCAAATACCACATGTTCTCACTTATAAGTGGGAAAG 
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TGGGAACTAAATGATGGGTGCTCATAGACATAGAGGGACAACAATAGACACT 

GGGGACTGGTAGATGGTGGGGGAGCAAGCGTTGAAAAACTATTGGCTACCAT 

TGTCACTATCTGGGTGATGGGATCATCCATACCCCAAACCTCAGCATCACGCA 

ATATACCTATGTAACAAACCTGCACACATACCCTCTGAATCTAAAAAAGAAC 

AGCAGGGCCTTAAATGGGGTGATGCCTTACAGATGTAGGGCATCATATCCAT 

AGGAGACATTCGCCAGGCATAAGGATCCTCTGGAGAGACACATCAGCTTGAG 

GAATACAGGGTGGTGGCTCTGTCCAGCCACCCTGCCCCATCTCCTACCCTCCC 

ATCCATTCTGCATTGGGTGCCTCCTGCATGCAGAGTCCTCCTCCAACAGCAAA 

TGCCTCCCAGATTTGCTGAAGGGCCTGCTCCAGGGTCTCTGTGGAGACGCCCC 

TGGGACCCTTGCTGACCACCCTCCACAATCTGCACCCCCATCCTGTGCCCCCT 

TCGTTTTTTTTGTCCCACCACTACCTGGTGGTGAGACAGCTCTGTAACCTGCGC 

CTAGAACAGCGCCTGGCACTCAGCCGGTGCTCTGCGAGGATCTGGGGGAATG 

AACGAATGATTGAATGAATGAATAGGTAGGAAGACTACCTGAAGGAGGATGT 

TGGCCAGGAAGCCTGTGGTGGCCTGGCTGTTCTAGCAGCCCAGCCCTGGCCC 

GCAGATGCTGATCTCACAGCAGCCACACACCCTTCCCTTGGTCAGCCTCAGCT 

TCCCCATTCCTCATGCATTCTTGGTGACTTCTGAGGCTCCTCCCACTGTGGGCC 

TCTGACCCTGCTGGAGGAACATGAGGTGAGCCTGGTGAGCTGAGGGATGTCG 

GCAAACAGCCCTGGAAGGTTGGTGTGAGGTAGATTGAGTGCCATGTTTCCAG 

AAGGTTCTGAGGCTCTGGGACAGCCTGAGAGCCTGAGAGGCTGGGAGAGGGT 

TGCCTTCCTGTTCATGCTGGCTGATAGGTCTGTCAGTCTTTGCATCTAGGACA 

GCCAGGGCTGGACATGTGCACCCACCGGAATGATAGAGGAGGGGCTTTCCTC 

AGTGGAGGGTGTGTCCTGGAGAGAGCAGGTGGCAGCTGGGAGGCTACAACCC 

TGCGGTCCTGCTCCCTCAGCTGATGGGGGCCCTCCTCACCTCCTTGCTCACAC 

AGAGAGTGGCCACTGGGCTCGGTCAGGCCCCACTTGTATTGTCTGTCCTACCA 

GGCCATCCACCCtTACTCCCCCACCCTCAGCCCTTGGCCTCACCTGAGCCCAC 

ACCCAGCCCAGCCCAGCCCCACCTTACCTCACCTGCGCACCTGGCCTTACTCC 

CTCCTCTCAGTTCCCCACCCATCCCCCTTCAGCTCCTTGGAGCTTGCAGGGAG 

CTTGAAGACAATCCCTTGTCCAGTCTCTCTCCTGCCTCAGTGTCCCCAAGCCA 

ATGGGACCCCATACCCCCAGAGCAGCAGGCTGTGGCACAGCTCAGGGCCCAC: 

GAGACAGCCCTGGGGGCCTCCAAGAAGCCTCCCTAGTGAGTTATTAGAATTTT 

TTTTTTTTAGATGGAGTTTCGCTCTTGTCACCCAGGCTGGAGTGCAGTGGTGC 

GATCTCAGCTCACTGCAGCCTCTGCCTCCCGGGTCCAAGTGATTCTCCTGCCT 

CAGCCTCCTGAGTAGGTGGGATTACAGGTGTGTGCCACCACACCCAGCTAATT 
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TTGGATTTTCAGTAGAGGCGGGGTTTTACCATGTTGACCAGGCTGGTCTCAAA 
CTCCTGACCTCAAGCGATCAGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTAC 
AGGTGTGAGCCACTGCATCCGGGCTAGAATTTTTAAAATATGGAGGCTTGACC 
TGCCTGACCGCCCTGCCTTTTCACTCGTTTTCAGAATAATGAGCAAAGCAACC 

5 TGCAGAAGATCAAATAGGATTTTTCAGTATTATCATGAATGCATGGGCTTTTC 
AGCTATTTGATGTGTTTTCATCCATGACTCTTTCTGATGCTCAAATTGTCCCAT 
TGTTGATCAGCCCCTTCTCAAACTTGACAATGAACAATAGTAGGATATACCTT 
TTTCCTAAGATAAAATAGCTGAGGAGTCTATGCTAATATAGTCCATTCAATTT 
AAGGACTGGTTTGTACTTCTTTGATTTTTTAAATCTGTATCACTTTTATACTGA 

10 AAATCTTGGTTCCAAAGGAATTTAGCATCATTCTTTGATACACACACACACAC 
ACACACACACACACACACACACACACACACAGAGTTCTTTGGGATCCTTTGG 
TTGAGTATTAGTGATCATATTGATCATGGCACTCCATCGTGTGACTGTACCAC 
AGCACAGCTGAAGCGATTGAAGGACATTGTGATTGTCTCTAGACTTTTGCTAA 
TATAAAAAATGCTGTTTTGACrATCATTATGTATATQTACTTTTCTATTTTTCC 

15 TACATATCTTTGGGTTATTTCCTAGAAGTGGGGTTTCTGTGTCAAATGATACA 
TTTAGTAATATCGCTAGGTCTCTTTTCTACCAGTGATGAGGAGTGCACCTGTG 
CCTGCTGCCTCTCCCACAGAGTTGCCATCAAAAGTCTGGATTTTTGCCAATCT 
GATAAGCAAGCAGTGGTATCAGAGTGTACTTTTCATTTGCATTTTTCTTGTTAT 
AAATGAGGTTGAACATTTTTTCATAAGTGCAAAGGTCATGAACCCCATTTCCT 

20 ATGAACTGCTCATGTCTTCTGCTCCCTTTTCTATAGGACTGTCTGGCAGGTTCC 
TCACCTCAGTTTTGATGTTCTTTATGTATCAGTGATATTAATCCTTTGTGATCG 
AAACTGCAAATACTTTTCCCATTTGGTCATTTGTCTCCTTTACTTTGCTAATGG 
TGCTTCTATTTTAATTTTTCTTATTTTATCTGTTGCTTTTGGATTTCAAGTCATA 
GTTGGAAATGTTTTCCCGATTCACAGGTTCTTGAGrGAATTGACCTGTAGATTC 

25 TTGTAGAATTTGCATGGCTTCATCTCACCCTTTACACTTGAGCCTCTGATTCAT 
TTGAGATTTATTCTCGTGTGCGGTGAGATGGACCCAAGGTATGCTTTTTCTTTC 
TTTATGACTGTCCAGTTGTCCAAACGCTGCTTATTCAAAGCTCACCTTCCTCCT 
GCTGAGGTGGGAGGGGAAGGCAGAGACTCCATGCAGGTGTATGTATGTTCAC 
CCGGAAAGTAACT»^'CTGAATGCGTGCTGTGTGCCCTGCAGAGCTGACTGACT 

30 GCCCCCATCTTCTGACCCGCTGCCCTTCTGCACCTTCACTGATATGCTTCGCAG 
GAAGGTACCCTCTCCTCGGCCTCCCTTGTGTTATCACAGCGCCCAGCACAAGC 
TGGGACCCACTCCATCCACTCTTGCTGCAAATACAGACTGTGCACTCCTGTGT 
GGCACCTCACTAGGGTGGGGAGTCAGGCGTCAAGGGCCTGGGCTGGACTCCG 
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TCCACGATAAAAGAGCAGACTTTTGGCAGGGGGTGGTAGATACATCAAATTG 
ACCCAGGGTGCTCAGGGACGACCTGGAAGTGTCACATGAGTTATGGGTCTCC 
TGGCGGGGCGGAGAGTCACCGCTGAGCTCCACCTTTAGTC.CCTGCCTCAGGG 
CTGTGTGCAGTATCCTGAAAAAAGCCGGTCTCTTCTGTTGGCTGCCCCGCACC 
5 CCCCCCCCAAGCCTGAGACGTGCTGAGACCCAAATGCCAGTAGCGGGCGGGA 
GGCAGTGCAGAGACCTGTGAAAATCTTCCGATGTGACCAACCCACCCCCACC 
CAGGGGAATCCACGGGGCGCAGCGGCAGCGCGGTCTCGACGGTGGGGAGGA 
CAAGGCGCCCATTCTCCAAGTTACTAGGGTGGCGCAAGCGCCTCTCCCTTTAA 
CTGCCCAGGCAGCCCCGCCTGCTATGCCCGGCAGTCAAACCAGGTCCAGACC 

10 CCGGCTGTGCCGCCCCCACCCCCCAGCAGCCCTGCCGGCTCCCCACCCACACA 
GCTCCCACTGGGAGGTACCTGACAGTGGTGGTCCCATCTCACCCTCCCGGTCC 
AAGGGCCCTGGTGGGAAGCGACCCGAGCGTATTCGCCGCCCGCAGCTCGCGC 
GCCTGGGCACCCGGGGGCGCTCACAGTGATCGCGGGGCCAGCACACCCTCAC 
CCAGGACATCCCTTTCCTCCCCCCAACCCCAACTCCGGAGTGGCTCAGAGGGA 

1 5 GGGAAAGTAGATGCCGGC ACCCTACCCCGCCCCTCCTGCCCCCGCTGTGCCGT 
TCTCGGTCGTGGTATCGGCCAGGCCTTACCCCTTCCTCCCTGGCGCAGCTGGG 
GTCCTCCTCCGGGCCAGGCAGAGCAGGCGGGCATCAGAAGTGGGGGCCAAGC 
AGGTGGGTGAGGGCAGGGCAGGAGCAAGCAGGGGAGATGCAGACGGGGCGG 
GGCCAAGCAGGTGGGTGAGGGCGGGGCCAAGCAGGTGGGCGGGGAGGGGGC 

20 GGGGCCAGGCGGGGTAAATGCACACTGGAACGGGGCCAAACAGGTGGGCGA 
GGAGGGGGCGGGGCCAAGCGGGATAGATGACACGAGCGGGGCTAAGCAGGT 
GGGCTCGGGCGGGGGTGGGGGTGGGGGCGGGGGCGCAGGCGGGGGCGGGGG 
CGCGGACAGGCCAAGCCAGGGGGTGAGGCGGAGGCAGGGCCAGGCCGGTCC 
GTGAGGGAGAGGGCGGGCCAAGCCGGTGGGCGCGGGCAGGGACGCCCTGTG 

25 CGCGCCGGACCGGCGGGGGCGGGGCGTGCAGGCGGGGGCGGGGCACGCCGT 
CCCATGGGACCGGCCCTCGGCCACTGCCCCCTCCGGCCCCGCCCCGAGCGCCC 
GGGCTGGGCCGGCAGCGGCCCCCCGCGGCGGGGCTGGCAGCAGTGGCTGCCC 
GCACTGCGCCCGGGCGCTCGCCTTCGCTGCAGCTCCCGGTGCCGCCGCTCGGG 
CCGGCCCCCCGGCAGGCCCTCCTCGTTATtiGCCGCGGCCTCCTCCCCGCCCAG 

30 GGCCGAGAGGAAGCGCTGGGGTTGGGGCCGCCTGCCAGGCGCCCGGCGGGG 
CAGCGCGGGCCTGGCCAAGAAGTGCCCCTTCTCGCTGGAGCTGGCGGAGGGC 
GGCCCGGCGGGCGGCGCGCTCTACGCGCCCATCGCGCCCGGCGCCCCAGGTC 
CCGCGCCCCCTGCGTCCCCGGCCGCGCCCGCCGCGCCCCCAGTTGCCTCCGAC 
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CTTGGCCCGCGGCCGCCGGTGAGCCTAGACCCGCGCGTCTCCATCTACAGCAC 
GCGCCGCCCGGTGTTGGCGCGCACCCACGTCCAGGGCCGCGTGTACAACTTCC 
TCGAGCGTCCCACCGGCTGGAAATGCTTCGTTTACCACTTCGCCGTGTGAGTA 
TCGCCACCGGCGACGGCCGGCACGAAGGTGCTTCCTGAGAGCTGGTGTGGGG 

5 GAGCTCTGTCCCAGCGCCACCTGCCCCGTCGGAGCTGCGACCCCGGAGCAGA 
GGAGGGAAGGAAGTGGGGAAACGCAGAAACACAAACTCTGCACTCTCCCTTG 
AAGTTCAGAGGCGCTGCTGTGTCTGGGGGTGCGCATCTTCTCGCAGGCCCGGC 
GTGGGGAGGGAGCCGGCTGGGGAGGGGACCACCTGGAGCCCAGAATTTGGC 
TCCACACCTCCGGGAGGGTAGTCCAGGTGTGAATCCTTCTGGGAAGAGAAGC 

10 GTGCTGGGGAGCGCACCCTTGGGTGCAGTAAGATAACTCCTCAAGGTCGGCT 
GAGACTCGAGGCTCAGGAGCCCCCAAGAGAGAAGGCCCCTGATGTCCGGTGC 
CGCATCTCAGACCCCCTGAGGCCCAGGTGGACTCTGGGGCAGGGGGCTGTCC 
AGGATAGGAAGGTGACGGTGGCGGTGCTCCCTGAGGGCTCAGCATGCCACGG 
GCCGTCCCCACGGGCCCCACATTAACTAATTGAACCAAGCTCATGAAAACCT 

15 GGCTTGATGCAGAGAGCGGAGAGGCACGTTTGCAGCTCTCACTAAGAGGCAG 
CTGTGCTCCCGGAGAAAGCAGCGCTGGTAGCAGAGGCACCTGGCCCCCCTGT 
TACCAGGTGGTTCCAATTCCCGGTACAGCGTGCCTGAGGAGGGCTGGGCACT 
GGATTTCTCAGGGACAGGCCTGGGAAGTCACCTCCGGGAAGGTCCAGGCTGC 
TCTCCTCCATGCCTGCCTGGGGCCTTCCCCCGCTGCCCCAGTGGCCCTACTTCC 

20 TGGCTGCCCAGCCAGCGGCCTTTTGGTGTGGTGCCAGCCTCTGGCCTGGGAGC 
CTCTACCCAGACATCCCATGGCTGATGGCTGTGGGGCTCACCTGAGGGCTGA 
AGGGTGGTCTCCCTGAGCGTCCTCAGGTGGAAGCATCTCGTCTGCCTCGGGCA 
GGCTCAGTAGAGAACTGGCTGGAGGGCATCCAGAGGCCTGTCGATGCCTGCT 

ggcagctgccaccagggcctcagggcgggtGacagcaggagccaggcccca 

25 AATGGCTTCAAGCATCGTCTCAGGTGAGGGGGTGGGGTAGGGGTCGCAGGGC 
TACTGCCTTCCTTGCTAAGAGGTGGCCTTCCACATCAGGAAGGGGAAGTCTTA 

cccacctccctcctcaaagatgtggttggggggtgatcttggagacttttccc 

caccccagctcccaagcccctgtctcctgacatgtcagtgggtgcctgagcca 

cagccgctgctggtctgtgagaggagctggctgtgctcgtggct^caacagc 

30 GGGGGCTCGKjCTTGGGGTTTGGGAGATATTTGTGTGCAGTGACCCCAGGGGA 

acccagtccgatggcactgtgcaaatgtctagcagatgccaggttcacaggt 

gctgtgtgctggtggccacctgcctcccggaccccagactctctgagatgtcc 

aagggtgggaagacctcctcagccagaggccaaggcaaagtgcccgcagac 
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CCCCTCAATTCTCACTTGTATTCAGGTTTGTGGACACATGCCTCCGGGCTCACT 
GCAGCCACCCGTGTGGAGGAAGAGGAGGAAGAGGGCTCGCCACGCCCCAGA 
GGAAGTCTCTGCTTGCACTTGTGTTTTCTTCCTCATTTGGATTGTTTAGGTCTC 
GGAAGTTTGCTCAGCAAGAGTCTACCTTCGCCCAGCCTCCGCAGAGCTGGCA 
AGGCAGGGGTGGCTTCTGGGGACAGGGGCAGGATGGCTTCTGTGAAGGGGTG 
GCCAGGAAAGGGATGCTTCTGTGAAGTGGCCAGATCTGGGGCTGGTCCTTTC 
CAGTTCTGGGTCCTGTCCTTGTAAGACGTGTGCCTGGCCCTGGGAAGTTGTCA 
TGAACAGCCTCCAACCCAGGTACCTCCATGTGGGGTGGGACTGGAGGTGGAC 
CCTGACTGGGCAGGACATGTGGCTTGGTGGGGGCCTGGGAGATGCGCTCCCA 
CCCTCAGTGCCCTAGGAGGTTGGCAGGACCAGTCTCTTGAGGGGAGACCTGG 
TCATAGAACCAGGATGGCAGAGTAGCTGGAGGCCACCTGCAGCCTCACGAAA 
TCAGGTCCAGCCTTGCGCCAGTCCAGCCTTGTGTTGCTGTAAGAAGTCTTGGC 
AATGGTGGCGGAGACAGGGCCCTGTCCTTAGGTGGATTGTGAAATGAGAAAT 
ATTGTGGTTTCTGGCCTGAAGACGACAAGCCTGGCCTTGAAGAGCCAGGGCC 
CAGGCATGTGTGCAGGGGGGTTGAGGCAGGAGCCGGTGTTCCGCTGACCCTA 
CAGGGCTAGGGCGAACTCTCCTGTGTGCGCCTGGGGAAGCCAGCCAGGATAT 
CCTGTCCCCTCCTCGGGGAACCCCCCCTCCCCAGGAGAGCAGCTCTTCCTGTG 
TTCCTTCTGGCCTCCCTGGGCTGGCCGGGGTGGACACGGCACCTGGGCCACAA 
CTCTTGCCCGCTTAGGCTTCCTGCCCCCAACACCCCCAGTTATCTGTGGTTGCT 
TGCGCCCCCACTGGGGGCAGCTCCCATCTTAGCTGAGGGGATGGGGGTCCAT 
ACAGCCCTGCTGCACCCAGCGGTCCTAGCTCCCTTGGTGTGAGCCTTAGCCAG 
GGGTGTGGACATCCAGGTGGCATCTACTCTGGTTGCTGAGATGCTTGGAGATC 
CTGGTTGTGGCCAAGGAGAGGGATTTGAGGGGGGACCTGCAGTGCATGGAGA 
CTGCCCCTGTAGGCTCTCAAACCTAAGTCTGGCTCAGCAGCCCCTGTTCCCCG 
25 CTCCTACAGCCTTGGCTTTCCCTGTTTCCCAAGAACTTTCCAGGGGTGTTGGC 
AGAACCCTCGCGGGTGCGTGTTAATCAACAATGGAGCCAGAAGCAGGTCCCA 
GAGGTGGATGGGGCTGATCTGGCTGCCTCCCCAAAAGACAAGGCCATGCAGA 
GCCAGCTCTCAGGGTGCAGCACCCAACTTTGTGGACTCCAGTGCCCTCCTATC 

tcca;h3ctctggcattggagtctgaagcagctgctgggcagttgttctgcagc 
30 tgggcagggaggccagaggccaaggcaggtggtggggctgggcacagagga 
ggacggggctctctgagccatcaggtctctttctctccatcttctttaagatct 
gtccacccactcctcagccctagattcccaggaaaactgggaattccttttgg. 
ggcttttcccagtgctaccttctgtggggcatgagcccttggtctcatcccctg 
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GCTGTCTGGGGTCCCCTCACCTACATCGCAGACCCGGCCCAGTGTTTCTAGAT 
CCGCTCTTGCCAACATGTCGAGTGCCTAACTGTGGCTTCGTACTAGTTGGGGG 
GAACACAGGCATCCCTTACCCCTGGCCGTCAGCTGGGGAGGGGGTAGTCCAG 
GTGGCAAGAGCCTAGAGCCTGGGAGAAAGGGCCTCCCCAGCCTGTGTGGGGA 
5 CAGGGGATCAGGAGGGTGGTAGGGACTTGGGTCCTTGCAGGTGGCCTGGGCA 
GTGCCTGGTGGGCATGGATCAGGTGGCAGGGCTTTGTGTCTGGGCATGGCCCT 
GTGGCTGGGCTGCCTTGGCCAGGGTCCTGGGCCTTCTGAGTCTGCGTCTCATT 
CTCTGTGGCAGGCTCCTGGTGCTGGCACCTTCAGTGCAGGGTGCCTCGGTTGG 
AGGGAACATTCACCCGTGCCTGGCACTGAAGCCACAGCTCCCTGTGGGCATG 
10 GTGCCCGCAGCTTGAGCCTACAACCCTGGGGTACTGGGGGCAGGAGGGTCCC 
TTTTTGCTTGGCAGTTGCGAGGTACTGGAACCTTTGCATATAACGTGCGGGCT 
GCGTrGTGTCACAGGGCCGGGTCTGAGGGAGGGGCTTTGCGGGGACACTCCT 
GGGTACCGATGGGATGAACTTCCGCTGGGCCTGAAGGCCTGGGATGCTTTTG 
GGCCGGCCAGTGCGGGGTCAGGTGGTTGGAGCCAGGGAGCTGGCACGTCAGC 
15 TGCTGCTATGAGTTCTGCCACGTGCCATCCTGGCATGTGGCACGTCTTGCAGA 
GGACAGGAAGGGTGAACAGCTCCAAACCCACCCTATGTGTGCTCCAGAGCTG 
GAAACAGGCTGTGCGGAGCTGAGTGCCTTTGGGGGCAGCTGAGATGGGCAGG 
AGCACCGTGCCCACCGAGGGTGTCAGAGTGACAGGGGTTGCCAGGCAACTCC 
ACCCGGGTGGCTGATGCCCCCGTTTTTAAAGGCGTTGGGTGCCGAGAATTCCC 
20 CTCCTGCTGCTAGGCTGGTTTCAAGTCTGCTCCTTCTGCACTTGACGTCCTTGC 
CAGTGTGATCCAGAGCTGCTCACACGAGCAAGCGTCCTCGAGTCCTAGGGAT 
GCTGTTAACAGGGCAGCGGGTCCAGGGAGCTGGTCCTGACGCAGGGGTCCTG 
ATGCAGCGGGTCCAGGGAGCTCATGCCTTCCAGCGATGGGAATGCCACCGGT 
CCCTGTAGCCAAGGTGCAGATGCACACGTCCTGGTCCTGAGCCCCTGCTCGGC 
25 CCTAGGCAGGGCCCCCAGTTCCCTGCATTCCACCCGCCTTGTCTGGGAGGTGG 
ACGAGCCCTCCGTAGCCCTGACATTCCAAACTGGCTTTTGGCCCCTGCGATAT 
CTTGCTGTGATTCAAGACTCTGTCCACGGGCAAGAACAACAAGGCTGGGACA 
ATCTCATCTCAGGCATCTGTGGGAGGAGACAGCTCCCAAGGGCGGTGATGCC 
AGGAGAACATTCCAGGCCCAGGAAGAGCTGAGAGACAGCAGATGCCCAAGA 
30 CGAGTTGCTGGTGTGTGGCCAGTGCTGGGAGAGCATGGTGTCGGCCCGGGGA 
GATGCCGCAGAGAAGCTTCCAGAAGCCCAACATCTGCTGCTGGAGGGGAGAA 
TGAGGCAATGAATATCACCATCCTGGGCCACTTTGACCCAGACCCCAAGGAT 
TTGGTTACAGAGGGGAGGGCGGCAGCGTCTGGCTTCACTCTCGGGAGGTGCC 
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TTGGGGCTGGCCAGTATGGGATTGCTGTCACTGTGGGTCACTCCAGCCCCGGA 

TGCTCCTACACCATGCTCTGCTTGCATCTGGGGTCCCTGCGGGCACGTCGGTG 

AGACCGTCCTGGCGTCCACACCCCTTCCTGGTGGGTAGCTGTCTCTCTCCAGC 

AGCTCCTCCAACCACCATGAGAAACCCCAGACCTCACAGATGCAAAATTATC 

GTGGTTGACAAAGGGAGGCGGCTGGTGTCTCTGGTACTGGCTGGGTGACCAC 

AGGGATGCCGCGTGACCCTAGTCCAGGGGGCCGCTTTTACACGAGGCATCTG 

ACCATGTGATAGGGTGTGGCTTCTGCTGCCTGGCCCAGCCCTCTTCTGGGAGC 

CCCACGTGGGTGGGTGCTGGCTGGGGAGGGGCTTCTCTCTGGGGAGCAGGCC 

CTGGCTGGAGGTGGGGAAGATCCATGATGGGACCCAGGTGTCTTCCCACTTCT 

CGACCATCTCCTGGGAAGTTCTAATGTTTGGTTCAGGGTGTGGGGTCCATCTG 

AGCAGGCATGGTGCTTCCAGAAGGATCCAGGTTAAGGGTGGTGGGTCCAAGG 

GGGTGCCTGAGCCCTTCTGCAGAAACACCGTTCCCCGTAGAGCAGCAGTCCC 

CAACCTTTTTAGTACCAGAGACCTGTTTTATGGAAGAAAGATTTTTCCACCCG 

GGCAGTGGTGATCTTGGGATGAAACTGTTCCATCTCAGATCATCAGGTATTAG 

ATTCTCATAAGAAGCGTGCAACCTAGATCCCTTGTGTGCGCAGTTCACAATAG 

GATTTGTGCTCCTGTGAGAATCTAATGCCACCGCTGATCCAACAGGAGGTAG 

AGCTCAGGTGGCCGTGCACATTCACCTGACGCTCACTCACTGCTGTGCGGCCT 

GCTCCCTAACAGGCCACGGACCCGGGGTTGGAGAACCCTGTCTTAGAGGATT 

GAGGCTCGGGGCCATGGGATCGGCACTGTCATTGCCCTTGGAGGGTTCTGAG 

CACTGGAAGGACCTGGCTGTGGTCCCAGGCACTGGGTGGATGGACCAGCAGA 

AAGGCTCCCAGGAGGGTCGTGGCTCCCTCATCGGCACCGGACTCTCAGGATG 

AGCCGCCTGGAGTCTGTTGGCATCTGCCTGGCCGCCTCTGGCAGGAACTTCTC 

CTGATGGAAGAGCCGGGCTGGGGAGCTGACTCCAGGACAGGCCCCTGCCCGC 

TCTGGGACCTGGGGCCTGGGCTCTGCTCCCCATTTCGCTCCTCCACTCACAGA 

TGAGGACATTTGGGTCGCCTTACAGATGAGACAACTAGGCCTGGCCACTTTGC 

TCATGCCACACCCAGAAAGCCCTCAGGACACCCAGAAGCCCTTAGGATGCCC 

AGAAGCCCTTAGGATGCTGTGGTCTCAAGTGAGGTGGTGCACTATTCCTGGCC 

TCAGGCCCAGTTTGTATCCATGGGCACCCCCACTCCCACCTAGCACTTGCCAC 

caggaagcaccctgaggactgtcccctttgaga^ttggctctcaaagccttgg 
tctgtgcctgcgtcctggcagcgggtgagctacggctgccctcacccagaga 

GTGCAGGTTCCCTGGGGGCGGGGGCTCTTAGCTTTCCCTCCATATCTGCTAAG 
GTGCACACCTTTCTCTGCTGCCTTCCAAAATCATACTCTGGGGCGCACTGAAA 
GGGTGCTCTGAGTAGCTCTCCCCGGCTCCTGGCTGAACTTGCTCCAGGCAGAG 



WO 02/054069 



192 



PCT/US01/49808 



ATTTGAAGAGATTCAGAAAGCAGGAACCACAGGCAGAGCAGCTGGACAGGG 
ACCCCCACCCCTGTACTCCCTTCGGCCCTGGAATGCTCCTCTGCATGGAGACA 
GCCCAGCTGCACTGGGACCCCTCACCCCACCCTGGGGCCAGGGGCTGTGTCCT 
CGACTTCCACGCAGAGCAGCCCGGGCAGCTTAGAGGAGGGAGCAGAAGGAG 
5 CTCTGACTGAGAGGGGTAGACCCCTCCCAGAGGCATCCCCACCCTGCCCCAA 
AGAGGGGAGCAGCTGGAGACGGGGCGGGAACCTCCTCTGGCTATATCTAAGC 
AGCCCCGGGAACCATGCGCTTCAAGCGTTTGATCTTGACGGGCCCTGGGCTTG 
GCATTCTATTCCGTGACGTCTCCAGCTGTTCTCTTGTGAAGAGTCACGTCTGTG 
CTCTTGGCTAGGATGGCTCAAATATTATAAAAGATACAGCTTTGCTAATTAAA 
10 ACAGCATGACATTGCCTTATAGGCAGGCAGATCACCAATGGAACAGCTTTTT 
AAAAACCGCATGTAACTCCATTGTATATGAAGTTTCGTAAAAGAGGCAGCAT 
CTCCAACCAGTGGGGAACAGATGGGCGGCTTCATGCGTGCTGCGGGGTGGGT 
GGGTGGCCATGTGGAAAGTGACCAGACTGCACCCACTTGTCACATCACACAC 
CAGGATAGACTCCCAGTGGGTCAGAAAACCACTTGGGTACCTAAAGAAAGCG 
15 CCTGGAATTCTTTACCCTGAGAGCGAAGAAACTTTTCTATGACTTGAATGCTA 
GATGCGTGGAGGAGATTTATGTGTTTGACCACATAAGAAAGAAGCAACCTTG 
GCTGGGTGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGC 
GGGCGGATCACCTTAGGTCGGGAGTTCGAGACCAGCCTGACCAACGTGGTGA 
AACGCTGTCTCTACTAAAAATACAGAATTAGCCGGGCGTGGTGGTGCATGCCT 
20 GTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCCGGGA 
GGTGGAGGTTGCAGTGAGCCGAGATCACGCCGTTGCACTCCAGCCTGGGCAA 
TAAGAGTGAAATTCCATCTCAGGGAAAAAAAAAAATCTTAATGGGAAACAGC 
CAATTGCCAGGCTGGGGGAAGATGTGATGTTTCACGAAGGGTTAATCTGTTTA 
CTACAGAAAGAGTTTCTGAAAACAGAAGAAAACCCACAGCCAGCAGGAGAT 
25 GTGAATACCACTCACAGGACAGTGACCCACAGGGGGACCTTGAACATGTGAG 
AAGGTGCCGCCTTCACTCCTGGCGAGAGGGGTGTCGAGGAAAATGTCACTGA 
GGTCCCTTCTGGCTGGCAGAAATCCTTTGGCATTTGCTGGTAGGCATGCAAGA 
GGGTAGACTCTGTGAGGAAGGGATTTGGCAACATCTAGCAAAACTGTCTTCA 
CTGACCTGTGACCCAGCAACCCCCTCCCCAGAACCTACCTGAAGGCACT^C 
30 AAAGGCAGGAGAAGCTGCGTGTGCAGGGCTGCTTGTGAAAGTACTGTTTGGA 
AAACTGCAAATGCCCATTGTAGGAGACTGGTTGAATCAGAGATTTGTGGAAT 
ATTATGCAGCTGTAAAAAAGGAATGCATAGATCTCTCTTATTATGAAGTGATT 
ATCAGGATTTATTTAATTATTTTTTTCAGGCAGGGTCTTGCTGTGTTGGCAAGG 
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GTGGTCTTGAAATCTTGGCCTCAAGCAGTCCTCCCACCTCAGCCTCCCAAAGT 

GCTTAGATTACAGGTATGAGCCACCACACCTGGCGTATCAGGATGCTTTTTAA 

GTGAAAAAGCACAGTGAATAAAAATATAGCAGCTATTCTTCATCTGATAAAG 

GGGAGGATGCAAGTCTGGATGTATATTAAAAACGAAAGCAATGGAAGCCTCC 

AAATCTTAAAAGTTTACCTCAAAAGGGTGGCGGGGGGGGAGGTGGCATATGG 

AATCAAGATGCCTCTGAATGTACCTGGTTITATAGATTrCACTTTGAAGCCAT 

GTAAATCAATAGCTTGCGTAATTTTTAAAAAGCTGTTCGTAATCGCTTGGTGA 

AAAATCCCTAAATAAATCHTAACCAGAAAAAGTAGTCCCTCAAAATTGAAAT 

GAA.ACAAATGAGCCTAAAAATGTGTGCTGAATTAGTGGCTTAACCACCCAGA 

AGGACTGATTTCAGGTGACATATTTCTAGCGGGTTACTGCTGAAGACAAATA 

GAGCTGCATGCAGTGGCCACATAATTGTTGTGATACTATTTGTTATATTATTCT 

AAGAGTGCTGCATGGGAATTG.CAGGTTAAGGTAAGTAATTGTGTTGGTATCA 

CCAAGAATGGAGGTTT^ACAAAATATGATTCACATGTAAACAGAATTAAAAA 

CAAAAACCGTATGATCATCTCAATAGATGTAGAAAAAAACTTTTGATAAAAT 

TCATCATCCCTTCATTAAAAAACCCTCAACAAAAATAGGCATCAAAGGAACA. 

TATCTCAATAAGAGCCATCTATGACAAACCCATGGTAAATGTGGTACTGAAT 

GGGTAAAAGCGAGACACATTCCCCTTAAGAATAGGAACAAGACAAGGATGC 

CCACTTTCACCACTCCTATTCAAAATAATACTGGAGGTCCTAGTCAGAAAAAT 

CAGGCAAGAGAAAGAAATAAAAGGGGATCCAAATAGGAAAAGAGGAACTCA 

AATCATCTCCCTTCACCGATGATGATAAGATTCTATACCTAGAAAACCCTAAA 

GAGTCTGCCAAAAGCCTTCTGGAACTGTAAGTGAAGTCTCAGGATACAAAAT 

CCATGTACAAAAATGAGCAGCTTTrrrrill 11 1 1 TGAGATGGAGTCTCACTCT 

GTCACCCAGGCTGGAGTGCAGTGATGAGATCTTGGCTCACTGCAAGCTCTGCC 

TCCCAGGTTCACACCATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACAACA 

GGCGTCTGCCACCACGCCCGGCTAATTTTTTGTAT^TTTTAGTAGAGACGGGG 

TTTCACCGTGTTAGCCAGGATGGTCTTGATCTCCTGACCTTGTGATCCACCCG 

CCTTGGCCTCCCAAAGTGCTGGGATTCAGGCTTGAGCCACCGTGCCTGGCCAA 

TCAGCAGCATTCTTATACACCAATGATGTTCAAACTCAGAGCCAAATCAAGA 

ATGCAATTCC £ TTT ATAAT AGACAC AAAAAGAATAAAATACCC AGGAAT AC A 

TCTAACCAAGGAGGTGAAAGATCTCTACACAAGAGTTACAAAACACTGCTGA 

AAGAAATCATAGATGACAGAAATGGAAAAACATTCCATGCTGATGGGTTGGA 

AGAATCAATATTGTTAAAATGGCCATACGGCCCAAAGCAATCTACAGATTCA 

ATGCTATTCCTATCAAACTACCAATGCCACTITTCATAGAATTAGAAAAAACT 
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ATTCTAACATTCATGTAGAACCAAAAAAGAGCCGAAATAGCAAAAAAGGAA 
CTAAGCCAGAGGCATCACATTACCTGACTTCAAACTATACTTCAAGGCTACAG 
TAGCCAAAACAGCATGGTATTGGTACATAGACATATAGACCAATGGAACAGA 
ATAGAGAACTCAGAAATAAAGCCACACACCTACAGCCATCTGATCATCAATA 

5 aaatcaacaaaaatatck:catggggaaaagacttcatattcaataAtggcac 

AGGGATAACTGGCTCCCCATATGCAGAGGAATGAAGTCAGACTCCTATCTAT 

caccatatacaaaaattaaagacagattagacgtggccaggcacagtggctc 

ATGCCTGTAATCCTAGCACTTTGGGAGGCCAAGGTGGGTGGATCATGGGGTC 

aggagtttgagaccagcttggccaaggtggtgaaaccccatctctactaaaa 
io atacaaaaattagctgggtgtggtggtacgcacctgtggtcccagctacccg 
ggaagctgaggcaggagattgcaccactgcgctccagcctgggtgacaagag 

C AAAACTCT ATCTC AAAA C AAAAAAAAAAAAATT AAAAAA G ATT AGATGTAA 
ACATTAGATTACAAACTGAAAAAATCCTGGAAGAAAATAGTCTTCTCAATAT 

tggccttggcaaataatatatagctaataagtcctcaaaagcaattgcaaca 
15 aaactaaaaattcacaagtgggacctaattaaagagctgctgcacagcaaga 
gagactgccaagggaataaacagataacctacagaatgggagaaaatatta 
gcaaacittgaactgacaaaggcctaatattcaaagtctacaaagaattaac 
aaatcaacaagcaaaaaaacaaccccattaaaaagtgggcaaaagggccgg 
gcgcggtggctcacgcctgtaatcccagcacgttgggaggccaaggtgggca 
20 gatcaccaggtcaggagatcgagaccatcctggctaacatggtgaaaccctg 
tctctactaaaaatacaaaaaaaaaaaaaaaaaaaaaaaattagccgggcg 
tgggggtgggcgcctgtagtcccagctactcgggaggctgaggcaggagaat 
ggcgtgaacccaggagacggagcttgcagtaagccgagatcacgccactgca 
ctccagcctgggtgacagagcaagactcggtctcaaaaaaaaaaaaaaaaa 
25 aaaaaaaaaacacaaaaaaacaaaaagtgggcaaaagatatggtagacagt 
tcttaaaaagaagacatacaaacagtcaaacacatgaaaaaatgctcatcac 
tgatcatcagagaaatgcaaatcaaagtcacaatgagataccatctcacgcc 

AGTCAGAATGGCCnTTGTTTAAAAAGTCTCAAAACAACAGATGCTGGAGAGG 

cagtagagaaaggggaacacttac^ccctgttgacgggagtgtaaatgagtc 
30 cagccaccatggaaggcagtttggagatttcccaaagaaccaagagttgaac 
taccattcgatccagcagtcccgtgtgctgagtttatacccaaagaaaaatta 
gtcgttctaccaaaaagacatatgcacctaaacattcatggcagaaagacat 
gagtcaactca.ggtgcctgtcaatggtagatcgaataaagcaaatgtgatat 
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ATATACACCATGGAATACTATGCAGACGTGAAAATCATGTCCTTTGCAGCAA 

CATGGATGC AGCTGGAGGTCGTTATCCTAAGTGAATTAATGCAGAAACTGAA 

AACCAAATACTGCGTGGTCTCATGGATTATAAGTGAGGGCTAAACATCAAAT 

CCATGTGGACATAATGATGAGAACAAGAGACACTGGGGAATACAAGAGTGG 

GGAGGGGGGAAGGGGAGCAAGGTTTGAAAAGTTACCTGTGGGTGCTGTGCGC 

ACTACGTGGGAGACCGGATCGTTTGTACTCCAAACCTCAGCATCATGCAATAT 

ACCTTTGTGACAAACCTGCACTTGTACCTCCTGAACCTATTAAAAAAGTTGAA 

TAAAATGGAAACTTTTGGCATGGGAGAAAAAAAAAAAAAACAGATGTAAGT 

TAGAAGAGGTTACGCAAAAATCCTCCATCCCTGCATTTGAATGAGGACTATC 

AGTATGCATCCAGGATGACGTTTACTTAAATATCTCTGTTTCCTGGAAAGGCC 

CAGGACCAAGGACCACTCAGCAGCTAAGAGCATCTCTAGACCCCAGACTGTG 

GCCTCTAGGTACCATTTCCCCCTGAAAGAAACCAGGGCCCCTGGCGGATTCCA 

GGTCTGTGGCAGAAGATGTGCAAGCTAAGCCTGGAGTATTCATCACAGCAGA 

AGGC AGGG AAGTTATC AG AGGCCCCCT AGGGCC ATGTCACTGGAACTTGAAG 

GGGTCCTTGTCTTCCCCCTTCTCCCCAAGCTGAAGATGCAGTGGGTAAGCCTC 

ACTGAAAATAACAGAGGAAGTGGATGAGCTGCATTGATGGGGTTCAGATCCC 

AAGTTCATCCTGATACTGAGGGGCAAAACCAGTGGACTTTGATTTGCATTTCT 

CTAAAGATCTGTGGAGGTCAATAGGGAGACCGCTCATGATGTTGAATCCTGG 

CAAGCAAACGGAGAGAATCTGACATGTGTCCCAGTCGTGATTAACCTGTGAA 

CTGTAACTCGGGGAAACCCAGTAGCTGATGGGGTTGCGTCCTTCTTTGGGACT 

CCCGCAGATGCCTGAGGCGGCAGTGCTGCTGAAGTCCTGCCAGGCTCCAGCT 

CCTGGTGAGGTCAGGGATTCCGCATGCAGCGTCACAGCTGCCAACCTCTCCA 

AAGCAGTAAAGGCCTTCGGACAGTGCACAGAAAGTGCTCCCATCCACAATGC 

TTCCTTGCCTGGATGGATGGATGGATGGATGGATGGATGGATGGATGGATGG 

ATGGATGGATCGATCGATCTCACCAAGCCTCGTGCTCTAAGTACAAGTTTACT 

ggaaatactcgggccagaggaacgcgctaaatgatactacaggaggccactt 
gcagaatcccagtggtgggggatggctctgtaggacacatgacctgatttcc • 
tcagtaagtcaatgaccaagaaataacggagagaaggaggaaagaatcacc 
tatccaccaattgtgacatatggccttagttaggtcctca/ : :taaatgatgaca 

AATATTGGATTAAATTACTTCTGGTTTGCTGCATGTTCCATTTTTGCCCATGCC 
GCACCTGCCATGGTGCATTTCTGATTCCCGTGCCCAGTTGGGAGGCTTTGGGG 
TTGTTGGGATGCTTTTGTGAATGGCTTCCTTCCTTCCCAGGCACAGGGGCTTTA 
TGGGGCTCTGGAGCAGACAGACCAGAGTTTGCACCCAGCTCCCAGCTGGCCA 
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TGTTACCTCAACTGCAAAACGAGAGACGGACTTCCTTGGGACGCGGTGGTGG 
GGACACTGTGGTCTGTCTCCTCTGGGCGCCCCGTGGCGCCCGGTGGTGTGAGG 
TTAGGATGTGGGGCGTGGTGTCCAGGGACTGTGTGGCCTGCCTGGAGCTGGG 
CAGAGACAAAGCATGGATGTGACACTCCCTGACTTCAGGCCATAACAGGGGC 

5 AGGGGTGGGGGGCAATCAGGGTTGCTAGAGGACAACCCCATGTAGTTCCTCA 
CTGCAGGGCTTGGAGGGAGCAGGTGAAGGCTTACCTGTCTTAGGTAATGAGG 
AGCTGTGCGGGTCTGAGACAGCTCTGGTTCCTTTAAGGGTATGTAGGGGTCAC 
TGCCAGTCTGTAGCTCCCTCTGTGGGGGCAGGCTGTGGCACTGTGGCGACTGG 
CAGTACTGACTTCCTCTAAGCCAACCGGGCAGGGACAAGTCAGCTTGTTGTTC 
10 CGTCCACAGTAGAGGGCGGCCCTGGGGGTGAACGGCTGAGCGTGAGGGCGTG 
GTCCCTCCCCCCAGACCACCCCGGAGGCCCCCTCCGCCCTCCTGCCTACAAAG 
AGTTCCCCCTTTGTGTTGAGAGGCCTGGGTCTCAGAGGTAGATGAATGGAGTC 
GTGAAATCAGGCAGGGATTCCCTTCTGCTTAAAACCCACTCTCCTTCCCATGC 
AGCACAGGTGGCCAAAACCTGCGGCTGGGTCAGAGGACTGGTTTAGTAAGAA 

15 11111 1 1 1 1 TTTrAAATGTGGTTATCTGCTAGCCAGAAAAGGTGTGTGGTTTCG 
CAGTCTTrCAGACAAACTGAATTTAATAGTAGCrGACGfGAACCCCTGTTTCT 
CCTCTAGGCTCCAGAGCCGCACCCCAGTGCAGGTGTGGAAGCTCAGTTCCTCC 
AAAACCCAGACAGGCCTTGGTGTCCTGACTGCCACCCCCAGGGGGCCCCTCA 
CTCGAGTCCTTGCTTGCAGTGCCCATGGGCAGCAGAGGTGTCCGGAGATGCA 

20 ACAGGGCCCAGACCCCTACCAGCCTGTGGCCCAGCTGGGGGCAGGACTCCGG 
GGCAGGGTGAGTCCCTGCGTGACCTGATTCTCTGTGGGCAGGCCGGGCAGGG 
TGGCTGCAGGGCGGCGGAGCCTGTGCTGCTGGCCTGCATTTCTCTGGCTGGAC 
AGATACTGCGCCCATATCCCTGGTCTTTCAAAGGGCTCTGTGGCTGGTGGAGG 
GCAGCAATGCCCCAAGGGAGGAGCAGGTGCCCAAGCCCCCAGGTGGCCCTGC 

25 GGGTACAGGTACTTGTTTACTCTGTGGCATCAGTCCTGCTTCCCAGCTCAAGG 
CCCTGGCCTCCACAGGCCTCCTCCCTCCCTCCCTCCCTCCCTCCTTCCCnTITrT 
CCTGTGGCTCCAAGGCCCACAGTAGAAAGGCTCCAGTCTGTGTCCATGTGTTT 
TGGAGGCCTCTCAGGTACAGGGACTGTCAGCCCAGCTGTAAGTGGACCTTGG 
- ACCGCAAATTTGCATGGGCCCCTTTGCCAAAGCCCTGTGCCQACTGCTGGGGA 

30 ACGCTGGGGACTCCGCTCTTGTGGGGCAGTGGGGCTGGGTGGGAGCAACCCC 
CCATCCGGGTGGCTGTGGCCGAGGCCCTTATGAGAGGGTGGTAGGTGTGTCT 
CCACAGTGGGGGACCAGATCTCTCTCTGGCCCCAAAGACTGCTTCAAGCCTCT 
GGAGGAGCACTTTGCCCCAGGCTGGGGCGTGACTGCTTCTGGTTGAGGGTTG 
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CTGCCTGTGCAGGTGGCCCTCTTGTCGGTCTTGTCTGCTGCATGGGGGTGTGG 

CCTCCTGTGGATGGCCCCTGTCCTGGGTGGTACTGCTGCAGGACCTGGGTTTC 

CGGGGGTGTCCGGGAGTCGGATGGGCCTCACTCTCCTGGCCTCCACTGGGTCC 

AGGAGTGGCGATCTGTGAGCCAATGGCCGTGGCTGGGGGCAGCCTGGGGTGT 

TGAGAGGCCCTGGGCAGCCACCTTGCCTCTCTGAGTTTGTCCTCATGTGTCTC 

TGGGGCTGGAACTCGGTGTCCGACCTGGTGATGGGTCAGGTGTGATCAGGTG 

TGTGGGGCTCGGTCTGCATGTGCAGGGCTGTTTGAAGGCCTGTGGGGCCATCT 

GATGCCCCCCTTTTCTTCAGCCTTGGAGAGGATGGCTTCTCAGGCCGCCTCGC 

AGCAGATTTGTAGTCTGGTTGCTGTCGTTCCTGAAATAGCCTCTGAAGCTCTG 

TCCGGGAAGGGAACCTTGAGTGTGGAGGAGATAAGCCTGCTGACTGGGTGAG 

CCGGCCGGGGCGGGGGTGGCCCCTCTTCCCTTCCTCCCCTGCAGCTTCCATGG 

CCTGGGGCTGTGAGAGGCCCGGGAAGGCACTGTCTTTGCGCCTGCACATGTG 

TGTGTCTGGAGTGTAGGATGGCACTGGTGCCGGGCCTGGATTTACTCAGCCCA 

GATCACGGCTGCTTTTGTTTACGTGGCCCCTGCTCTCACCCACAACTCCAGGT 

TTCTGGCTCTCGGGAATTTGAGGCCTGTGGCTGCTGTGGACCCTGGGAAAGAG 

CCTGTGCTTCCTGAGCCAGTGCGGGGCCTGGCATGGAGTAGGTACCCCGGGG 

GTGGACAGATAGGCAGAGGAAGGGATGGGCAGGTGGATGGGGGGCTGAATT 

GTGGTCAGTTATGGGTGGCGGGTAGATGGGTGGACAGCTGGATAGATGAAGT 

ACTGGGTGGGTGGAGAGAGAAAGGGGTGGGTGGACGGAGGGATGGGCAGGG 

AGAGAAATGGATGGGGCATGCTGCTTCCAGCACCCACTGGGTGCAGCTGTCT 

GCTGTCTGCTGTCCTGGGGGAAGGTCTGGAGAGGGCCTCTCGGTGAGTGTGG 

ACTCAGCCAGCCTAGTCCCAATACAGCTGGGATGCATTGCTGCTCCTTCCGCC 

ATCCCAGCAGCTGTCCAGAGATGAGACCCAGCCCCACTGTGTCTTTCTGGGAT 

TCACAAGAATCCTTCCCTGGGCTGAGGGGGGGTTGCTGTGGGTGTATCTCATG 

GAGAGCCCCAAGGAGCCAGGGAGGAGCCTTCTGGTGGTTTGGGTTCTGGATT 

GGTGGGTGCTGGGTGGGTCTTCATGGCTGTTCTTGGGGCTGCCAGGCCTGCAA 

GGTAGACAGGCCTCTGGACTTGAGTGTGGCTGTGGACGAGGGCAGCGTCGCC 

ATCAGAGGCGATGTCTAGGGCCACCCCCTCTGACTTGCCTCCTCCTCCTTCTG 

ACTTGCCTCCTCTCTCTHACGGGCCTGCTCCTGAGCTTGACACCTGCCTGGGG 

GCTCCCGGCGAAGGCCGCTGGTTTCTAGAATGCACCATCTCTTCCTGGCATGA 

CGGGAACCACCTGTGACATTGCCACCACCTCGCTGTAATCTGGGCAGCAGCT 

GTCATGATCCCACCATGTGCCCCCCGTGAGGCCCCCACCTGTTACTGAGTGGC 

AGGGACGTGCTCCCACACCCCCATGCGCCATCCTGCAGGTGCTTTCTGCTGAC 
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TTCTGTGTGCCCTGGGGCCTGCTCTCTTGGTAGGGGTTGACCCTGCCTGTGAC 
TTAGACAGCTTGGAGGGCCAGGACTGAGGGGAGGGGGCGGTGGAGTGAAGG 
GGAGGGTGGGCGCTGCTGGCAAGAACCACAGGCAAAGAGGGTGCCGGGGGC 
• AGGGAAGGCCTGGAGGTTTGAGGAGCAGAAAGTAGACCCTTGACCCTCCGTG 
5 GGCTGGTGAGTCGGGGCCCAGCCCAGAGTGACAGGGACCAAACCTGGGTCCA 
TGTGCCGTGCCCAGCCTGGGGTTCAGGTTTCTTCCCCGTGAGCTGAGCAGACA 
GGGAGGGTCTTGGGGGAAGGCTGTGGGCCCTTGGGTGGAAGAGTCTTGGGTG 
AGGGCCCTGAACTGGTAAGCGGGGCAGCGGCGGCAGGGGGCCCAGGGAAGT 
GGGGCCAGTCGGGGGTCCTCAGGGGTCCTTCAGGGATATACCTGCTGTCAGG 
10 GTGTGGGGAGTGGGAAGTGGGGGACGGGGTGGATTCCAGGATTCCGGGTTGT 
GCTTGGTCAGAGTGGGGAACTGGACGCTCCGTCCCTGGCTCAGCCTCTCCCGG 
CTGTGACCTTGGGGGACCATTGACTTTACTGTGTGCCTGGAGAGCCTAATCCC 
TACCTGCCAGTGGGTGACACAGAAGGCAGGAATGCAGAAGGCCTTTCAGAAG 
TTCTCACTGGCCTGCAAGGTAGAGGCTGTTTTCTGCTGAGGATACATTTGCCC 
15 CCTCTATCCCCCAGATCGGCGGCTGCTCAAGGAGCCTGGTACAGCTGCACGG 
AGGCGCAGCACCCACAGGACAAGTGGTGGAATGTTCTGGTTGTCCTTGGTGC 
AGGTGGCGGGCGGGGCTGGGGGGCTCTGTAGCCTTCCTGGCTTCGGTCCCCTG 
GACAGGCTCCACCTCCCTCTTATCGTGGCCCCTTTGGCAGGCTGCCTGCCACC 
TCAAAGTCACGCTGCCCTGGGCACCCTGCCTCTTCCGGGGACTGGGGCTGGG 
20 GCTGGGGCAGCTGTGTTTATGGGTGTACTCCCTGTGCTGGGCACTGCGCTGAG 
CTCAACACACAGGGGCTCGGGGAGGTCTGTGGGTGCCCAGGCCGAGATGTGA 
ACCCrGAGTTTGTGCAACTCGAGTTTCAGAGTGGCGGCCTCTGCTCCTCACAA 
GACATTGCCCTGCGAGGGGGTCAGCCCTGAAGCCGGATGGCCCGGCCCCCGC 
TACCACGTGGAGGCTCCCTGTAGGTGCTTGTGTAGATGCCCCCGTGCGGGGAC 
25 TTGTTTGGCTGATGGATCAGGGGGAAGGTTCTCCCCACGGTGTGAGGCAGCA 
CCGAGGGCTCCGTGCCCAGCAGGCTCACTGTCGGCAGTTGGGTCTGGTTTGAT 
AACCGTGGACCGGGGTGACAGGCCCTGACTCTGCAGAGCAGGACTGTGGAAA 
ATGGACACTGATGCTGCCCGGTGGATCCAGGCAGGGCCGGGATGTTTGCAGG 
ACCCACGGGACAATTAGAACGACTGGGCCTnACTGGCACAGGGATGTGGACT 
30 ACGGTGCTTGCCATCAGCAGATACAAGCTGTACGCAGTGGGCCGCAGGCGCT 
CCCTGGGCCGGAACACATGGACGTCCAGGTGTGGAATGGCCCGGACAGCAGA 
TAACAAGCCTTCGTCGTGGTCACCCCTGGGTGTGGGCTGTGGGTTTTAATCTT 
TTCATTTTTGCTCATCTGAATTTTCTAATTAAAAATATATTGCTTTTTAAAAAT 
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AATAAAAGATCATTTAAATTAAAATATATAAGTGAGGAGAGGCTGGTAAGAG 

GATTCTGGAGTCCCTGGAGTGTCTTCTGGCATCGCATGGATGAGAACTCTGAA 

TGAATTGAGTGTGATTTGACCTAGGAGAGTGCCGGGGTGGGGGCAGGTGCAC 

TGTGGTCCTGGGTCCACATGGTGCTAGGGTCCCATGGGCTCCTGGCCCCTGGC 

TGCTGCCCCCACATTGGGCCCTGCACCAGGCATGAGTCCCATGTCCTGTCCTT 

GGGGAGGCCTGGGGGACAGGAGTTGGGGGGGTGGGCAGTGCCTCTGTGTGGC 

AGATTCCAGCTTCTCTCAGGGGCCAGGATTACATGGTCCGTGTTGGACTCTGG 

GATGCTGGTGGGAAGAACAGTCTGGGCGAGGAGGAAAGGGTGGCTCTGCTG 

ACCCCTCCTAGGTGGGCTGGGGCTGAGGCGGATCCCACTGTGGGGAGCTGCC 

CAGAGATGGAAGGAGCTGGAGTCCACAGGGCGCTTCTCTTCTCTTTTCCTGTG 

TGTTAAGGTAGTTCAAAGTCCTCCTGCCCGAGGAAGTAAGACAGCAATCTCT 

GGGGTCCTGGGCACCAACTCTGGGCACTGATGGGCTGGTCCAGTGCAGGGAG 

CTGCCCCGGCCGCCCCCGTGGGTGCAGGCCATGGGCTCTTGTTTGCTTTCCCC 

CATCTGTAAAATGGGTAACGGCCGCTCCTTCAGGGGCTGTTAGCCACATGCTT 

AGACCGCACCGAGAAAATGCAGTCCCTCCAGAGCTCCTAACCTTCCATGGGC 

CACAGGCCCCTTCCCAGAAGTTTTTTTATTTTTTATTTTTTTTTGAA 

CTCACTCTGTCGCCTAGGTTAGAGTGCAGTGGCACAGTCATCACTCACTGCAG 

TGCGACCTCCTGGGCTCAAGCTGGCACCGTGACCCCTTCTCCAAGGCCTGCCA 

GGCTGGCCTCCAGTCCAGGTGTGAGGCCCTGAGGCCCTGGCTGCAGCTGGAA 

GCTCTTGGGCAGTGCACTGTGCCCACAGCGGACTGCCTCCTGCAGCCAGGTGG 

CCTTAGGTGATCCTCCAAGGTCCAGCGAGCCAGGATCTGAGTCAGGAGGAAC 

CACGGCCCCCACCAGGGCTGAGACTCCTCTGAGCTCGTCACTTGTCTTGGTTG 

GGTAGAGGCCTGAGAGGGTGGACACTTGCCCAGGGTCACACAGCTTGTGCGT 

AGCAGAGCTCCCTGTCCGGCCCCTGCCCCGTGCTGGCCCCTCTCACACCTTGC 

TCACGCTAGGTGCCTTGTACCAGTGCCTGCTTGGGGGTGGAATTGGAGAGGG 

GAGGGGTGGGCGCCGAGCTCTGCCTCCCACCAGCATGCTTCCCAGAGCTTCCT 

GGGAGAAGGTTTACGGAGGAGTGGCCTTGTCTGGTGTCCCTCTGGGTAGAGC 

CTGTGTTCCGGGCAGGCTGGGAACAATGAATGGTACAGATCCACGGCTCCCC 

TTGGGCCTTGGTGGGCTCCAGCCATGCCTCCTGGAAGGCCATTTGGi GCCTGG 

CTGGCAGGCAGGAGCTGTGTGCTGCCAQGCGGGTCTAGATGTGGTTGGTTCCT 

GGCTGCACCTGCCGGGCTCATGACCTGCCCTGTCCTTTCCTTGGGGAGCTTGG 

TGACGGCTCAGGTCGACTCCCAGGGGCCCGCCCTTCAGGGCCTGCATGAGTG 

TTCTCGGGCTGGCTATGGGGGCATTGCTTCTTCCTGTCCTAGCCGGGACCTGC 
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CCATCCACCCAGGGGGCTCAGGGACGTTTTTGAGTTTTCCTGAGGCTTTGCCT 
AGGGAGTGTCGCTGAGGAAGCTGGAACAGTGACCTTCGTTGTTGGGGGAGGA 
GATGCAGGCAGGCTGTGGCCCAGCTGCCCCATTTTGTAGCTGGGGAAACTGA 
GGCCCAGAGTGGAGAAGCGATCTCCAAAGGGGAGGTGACACATGGCTGAGC 
5 CCCGCCTGATGGGTGCTTGCTGGGGGGAGAAGTCCTGGCTGGGACTTCCCCTC 
CCGAGGCTGGGGTTTGTCCCCTACTTTGCAGGAGGAGGGGGGCTCCCAGAAG 
AGAGTGAGTTTTGAGGGAACACAGTGCTGGGGGTCCCCTCCTGGGCCTGGGG 
CTGTGGTTCTTAGAGGGGCAAGGGGCTGGCCCCTGTTGCTGGCTGTGGCAGCC 
ACAGCCTTCTGACCCTTGCTGGCCTGTCGTGGAGTGGGGTTCAAGATGCTATC 
10 ACTGACATCTGCCCAGGCTCGGGGGCTTCTGTGTGGTGCCAGCCTTGCCCGGG • 
GGCACAGCCATCAGCAGGCCAGAAGTCCTCAGGAGAGCTTGCTGCCAGGCTG 
CTGCCTGGGGCCAGGCTTCAGATATGCTGGCCAGCCTTGTCCAGGGGGCCTCC 
TGCCAGGTGGGCAGAGGGAGAGGGCCTTCCTGTCCTCATGAGAACAGGGCTG 
AGCTCCATGTTCCTTAGCCTCCCTTGGCCTCAGTCATCCTATCTGGAAACAGG 
15 GATCATAATAGTTTTAGTTTATAGAGAGCGAGGGCCTGGCCCCTGCGCTCTTC 
CTCTGTGGTCTCCTCTGCCCGCTCCCAGAACACCTGGCCTCCTGCTCTGAGGG 
CCAGCCGTGGGTTGCAGCAGTGAGCATGACCTGGCCACAGTCTAGGGGCTTT 
GGGGTGGGTTTGTGGGCAGCTGGGCTGCAGGGGGCTGTGGGAGCCTGAGGGA 
GTGCCTTCCAGGGGAAGCCCGACAGATGGGGGCACTCCAGGCAGGGGCAGG 
20 AGGTGACCTGGAGGTCGGAGGAGCAAGCTGTGGCTGGAGAGCCGGGCAGGG 
TTGGTCAGCAGGTCCGAGGGCCTGGGCCTCCCTGGGGTGAGGGCCAGGGTGC 
CTCTGCAGGGGTGGAAGTGGGCTGTGAGGGGGTCTGGGTTTATCTCCCCATTG 
CACAGGGCTCCATGGGGTGGGCTGAGCCATGAGCGCCCTTCGTGGCACCTTG 
GTGTCCGGCAGGGAGGCAGGGGCCCAGGAGGAGGGCAGGACTGGCTTCAAC 
25 GGTGTCGGCCTCTACCCTTTTGGTCCTCACGGAGGAGGCTGCTCCCTCTTCCCT 
GGAGCAGCCAGGGATGGTGTGGGGCCCTGGTCCCCTGGCTTGATATAGGCTT 
CTGCAGAGGATATGGTGTGGCCGGGGGCTCAGCTGAGAAGCCACCAGGAGAC 
AGATACAGCCTCAGGAAGTGGTTTTGTATCTCTGCTGAGAACCACAATCCGA 
GTCAC/JrGGKjACCATTGAGATCACGTCAGGGTGCAGGCTGGGGAAGGCTGGG 
30 AGCTGGGGTTTGGCAAAGGCGTTGGGGTGCTCTTGTGCTGGACAGGCTGCTG 
CCCACCGGGCCTGGCCCAGGTGAGGTGAGAGACACCTGCCCCCCACGTCAAT 
GCCTGGGCACACGGGCCCAGTGCACACAGCCAGCAGGCGTGCGTCTTGCAAT 
GGGGGCTGTGGGGACCCCTTTGCTCCCCTGATCCCTTCCCTGCAGACCCAGGC 
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CAGGTGGATTCAAGGTGTGGGGCAGGGTAGGAGCTGGGCACTGCTGGATGGC 

ATGGGCCACCCGCACTGGGGCTGCTGTGACCTCCGAGTCCAGGCACTGCCTTC 

CACCTGGTTAGGGACAGGGCAGGGAGGCTGATGGGCTGGCCCAGCGTGGGG 

ACCTGCCCAGTGGCCCTGTGGGTGCAGGCCATGTGCCTCTGCATTTCCTTTAC 

CCCATCTGTAAAATAGGGATAACAGCAGCTTCTTAAAGGGCTGTTAGCCACG 

TGCTTAGACTGTGCCCGGAAAATGCACTGTCTCTCCTGGGCTCCCAACCTTTC 

ACAGGCCACAGGCCCATTCCCAGAAGTTTATTTTTTATTTTTCGAGACAGGGT 

CTCCCTCTGTCACTTAGGCTAAGAGTGCAGTGGTGTGATCTTGGTTCACTGCA 

GCCTCGACCTCCCTGGCTCAAGAGATCCTCCCGCCTCAGCCTCCTGAGTAGCT 

GGGACTACAGATGCACACCACCACGCCTGGCTAATTTTTGTATTTTTTGTAGA 

GATGGGGTCTTGCTATATTGCCCAGTCTAGTCTCAAGCTCCTGAGCTCAAGCG 

ATCTGCCTGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCATGCACTTTTTT 

TTTCTAAGTAGCTTTACGGAGGCGTGCTTGGCATACGGTAAACTGCCCCAGTT 

TAAAGCAGACAGTTCCATGAGTCCTGACATTTGGAGGCCCTGTGAAGCCAGC 

ATCAGCAGAGTGGGCACACCCATCACCCTCACAGCGTCCCCCTGTCCCCACG 

ACCAGGCCACTGCTGCCCTACCATCAGGCACGATGGATGGGCTTGTGCCTTGT 

GGAATTTTATTTGGATGGACTCTGGCAGCCAACGCTGGTTTCGTCTGGCTTCT 

TTTATTCAGCGTCATGATTTTGAGCTGTGTGTCTGTTGTGTGTGTCGGCCTGTT 

GTTTGGGTAGAGCCTACTTTGCCGATCCATGCACCTGTTGATGGACATGTGGG 

CGGCTGTATCCCATTTGGGCTCTCACAAATGATACTGCTAGGAATAGCTGGAG 

TCTCTGTGCGAGCAAATGCnTTTGCTTCTCTTGTGCAAGCAAGTGGTTGAACT 

GCTCTGCAGCCCGCTGGGAAGTGGAACGGCTGTGTCATGGGGTGGGTGAGGG 

GTTCACTTTCTAGGACACTGACAAACCGTTTTCTAAAGCAGCAATCCCCTTTT 

GCAGTCCAGCAGCTCCGTGCGACAGCTCTAGTTCCTCTGTCTCCTTGTCAACA 

CTTGGTGTGGTCGGTCTTTAATTTTAGCCAATCTAACAGGTGTGTCGGGGCCC 

TCCCTGGCATTCCTGGTGACTCACTGTGTGGGGTCATCTTTTCAGCTGCTTAGT 

TGTTGTCCACATATCTTCTTTGGTAAAGTGTCTGTCAAATTGTCTGTTAAGAAA 

ACGTGGGTGGTTTTTCATTGAATTCAAGAGTTTTTTTTTTTTTCT^ 

AGAGAATCTCACTCTCCGTCTV.XCAGGTTGTAGTGTGGTGGCGTGATCTTGCT 

TGGCTCACTGCAACCTCTGCCTCCCGGGTTCAAGTGATTCTCCTGCCTCAGCC 

TCCCAAGTAGTTGGGACTACAGATGCCCGCCACCATGACTGGCTAATATTTGT 

ATTTTAGTAGAGACAGGGTTTCACCACGTTGGCCAGGCTGGTCTCTAACTCCT 

GACCTCGTGATCCACCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATG 
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AGCTACGGCTCCCGGCCGAGATGTTACATGTTTCTATATTCTGGACCAAGTCC 
TCTATCAGATACCTGCTCTGTGAAGCCTTCCCTTATCCCTTGCTAGTCCTTCAG 

TTCTTTTAAACAATTTTTTTTTTCTTTTTTTGAGACGGAGTCTC 
CAGGCTGGAGTGCAGTGGCGCGATCTCGGCTCACTGCAAGCTCCGCCTTCCG 

5 GGTTCATGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCGC 
CCGCCACTGTATCCAGCTGATTTTTTGTATTTTTATAGAGACAGGGTTTCACCG 
TGGTCTCCATCTCCTGACCTCGTGATCCGCCCGCCTCAGCCTCCCAAAGTGCT 
GGGATTACAGGCATGAGCCACCACGCCCGGCCCTTTTAAACAACTTGTTTTTA 
GAGCTGCGGTCTCACTCTGTCGCCCAGGTTGAAGTGCTTGGTGCTATGACGGC 

10 TCACTGCAGTTCCAACTCCTGGGCTCAAGCCATCCTCCCGTCTCAGCCTCCTA 
AGTAGCTGGGACCACAGGCGCATGTCACCACGCCCGGCTGATTGTTAAATTTT 
TTGTAGAGCTGGGGTCTGGGTATGTCGCTGACAATCAAAATGAGGACTTGAA 
GCATACTCCTCAAATCATCGAGGTTTATTGAGCCAACGTGAGGGCACGCCCA 
GGAAAAACTCCAGTCACAGAAGCATCTGTGGCTGCirrTTCCAAAGAGGTGC 

15 GTAGGAGGTTTCATTCATATTTATATGTTTTCTTGAAAAGGGGAGGGGCATCA 
AGTGAGACAAATGATTATATACTTGCGAGACTTTAGTTAGAGCCCAGTAAAT 
CTACATTTTACAGAAGACGTATATTGGAGGAAAGAGGGAGCATGGAAGCGTA 
TATCCCAGGGAGGCGGAAGAACATCTCGTTATCTCGTCTCGCCTTTGTTCTGT 
GCCTGGGAAGGAAGACAAGCCGGCGTCTTATGAAAAGGCTGGTTTCTGTTCA 

20 GCCCTTTGGGAAGAAAACCCAATGACAGTTATCAAGGGAGGGGGTGTGATGA 
TACGTATCCAACCGCACATCCATCATGGCCCTGAACTCAGCTTCCAGGGTTTC . 
TCTGGGGTTCCCTCGGCCAAGTGGGGATCCACTCAGTTAGGTGGGGGGGGGG 
GTGCTTAGAATTTTATTTTTATTTCTCATTGCCCAGACTGGTCTCAAAACCCTG 
GTCTCGATC AATCCTCCCACCTC AGCCTCCC ATTCTCTTAATAGTGCCCTTTTA 

25 GAGAGCAAACGTTTTGACTTTGATGAAGTCCAGTTTATCAACTTGTTCCCTTA 
TGGATTGTGTTTTTTGGTGCTCTCAGAAATCTTTCCCTAAACCAAGGGCACAG 
ATTTTCTGTTTTCnTCCAGAAATTTTGTAGTGTCAGGTTTCACACTGAGGTGGA 
TATTCCTTTTGAGTTAATGTCTGACCAGCTGGGAGGTGTCGGTGGAGGCTCCC 
TTTCTGCCCGCGGGTTCCAGCACATTTGCTGGGA7|tGCCCTCCTCCCTCCACA 

30 GCAGAGTTCTACCCGCCCTCACCACCCTGTATCAACTCATCTCATCGATATCT 

cccaacggctggggaacaagggctgactcggctgcaatcatcctgcagccag 
- actcggggacctaggactctgaacctccaactggtgtggcttcagggtcccca 
acttttcagaccccacctctgtgttgctctgcaagtggggatcagcagggctg 
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TACCCCAAATGCTGCTTGCTTCTCCCGCCCTCCTCCCTCTCCCCATCTGCTCTT 

GGCTGCACCCCCAACCCTGACCCAAGTGTCCTTCACGCTCAGCAACGCCACA 

GCCTCTGACACTCGACGGCACTAAAGTGTCAAGAGACCCAGGGGACCTGCGA 

GATCCATCTCGCCCCACACACCACCAGGTCCCCAGGACTTGGAGGATGCTGA 

TGGCCCAGCTCTGGGTCCTAAGCATGGCAGTATCACAGCCTCACAGGCACAT 

GTGTATGTGTGTGTGTGCACGTGTGTATGGGTGCGTGTGCACCTGTGTATATG 

GGTGTGTGCACGTATGCACGGATGTGTGTACACATGTGTATGGGTGTGTGCAT 

GGGTGTGCACATGTGTATGGGTGTGCATGTGTGTATAGGTGTGTGTATGTGTG 

CATGGGCGTGTGTATGTGTGCATGGGCGTGTGTGTACTTGTGTATGGGTGTGT 

GCATGTGATTGGGTGTGTGCATGTGTATATAGGTGTGTGTGCACGTGTATGGG 

TGCGTGTGCACCTATGTGTGTATAGGTGTGTGTGTTTATGTATGGGTGTGTGT 

GCACATGTGTATAGGTGTGTGTATGCGTGCACCTATGTGTGTATAGGCGTGTA 

TGTGTGCGCGTGTGTAGGTGTGTGTTCATATATATGGGTGTGTGTGCACCTAT 

GTGTATAAGTGTGTGTGCACATGTGTATAGGTGTATGTATGGTTGTGTGTGTA 

TAGGTATGTATGTGTGCACATGTGTGTAGGTGTGTGTGTTCATGTATATATGG 

GTGTGTGTGCACCTATGTGTATAAGTGTATGTGCACATGCGTATAGGTGTGTG 

TGTTTTATGTATGGGTGTGTGTGCACCTTTGTGTGTATATGTGCATGTCTATGT 

GTGCACATGTGTGTAGGTGTGTGTGTTCATGTATATATGGGTGTGTGTGCACC 

CATGTGTATAAGTGTATGTGCACATGTGTATAGGTGTGTGTTTTATGTGTGGG 

TGTGTGTGCACCTTTGTGTGTATACGTGCATGTCTGTGTGTGTTTGTGGGTGTG 

TGTGTACCTATGtCTGTATAGGTGTGTGTGTTTGTGTCTGGGTGTGTGTGCACC 

TATGGGTGTGTGTGTGTATGCATGTGTGCATGTCCCTCACAGGGCAGAGCTGG 

AAAGAACTGGAGAGGCTGAGGCTGGTCCCTTCATCTCAGGGTGAAGCACCTT 

TACCCAGAGAAGGAGGGTCGTGGCCTGGGCTACTTTGTGGGCTGGAGTCCTG 

GTCTCTGTCACTGAAAAAAGCAAGGCAGAGCGTCTGGGAGGGGTGTCATGCG 

GGGCTTTCTGAGTCACTGGCTGCGTGAGATGTGAGTAGGTCTGTGGGATAGG 

GAGCTGGCCAGGGACTTGAGCTCCCTGCCTGCTATGACCTCAGTCAGCCCCTG 

CCCATTGTCAGGCTCCTTCCCTGTCTAGGGGTACCCAGTAGAGCTGCCCAGCC 

CTGCCACTCCAGCTTGTTTGAGGAAAAAGTGCAACCATGCTGTGGGTGG^jG 

CAGATGTGATTCTGTCGTCAGAGGCATGGGGGTGACTCCATGGCTCTGAGCCA 

AGTTGCAGCTCTGGGGCCCTTCTTGATACTCCAGGGCCCCCCATCACCCCAGC 

ACCGAGATGGGCATGAGCAGAGGGGCTCATGGTGATGGGGTGGGCGTGGCTG 

GGAGAAGCCTGCATATGAAGGCCTCTGTCATTGGAGGAGACTGTCCCATGTC 



WO 02/054069 



204 



PCT/US01/49808 



CCTCCAGATCATGTCCCTGAGTGGGGAGCGCCTTCTGGGCAGAGTGGCACAC 
AGGGTCCCACCCATCTCAAGTCCATGGGATTAACTCCCCTGTGCTGGAAGCTG 
CAGGAGCTGCTGCCTGTGGCTGAGGAGAGGCCAGGACGGGCCAAGGAGAGC 
AGGCAAGAGAAGGGGTGGGGCTGGCAGGGAGGGGGCTCCATTAATTCTGAT 

5 GTCATTTATTGCTGTTCCTATATTTGTATAATAAATGGAAGAAAATAGTGGTT 
GGGGGCTTGCTGGGGACTCAGGGAGGGTTGGGGACCTCCAGGGCTTCCATTC 
GACCTGAGGCCCATGGTGCAGTGAGTCCTGCCTGGAGTCCCGTCCAGCAGGA 
GGCAGGAGAGCTGGTGTTTGGAGATGTGAGGGGCTGGGCCCTCCCAGGTGAG 
CGGGTTGGGAAAGGAAAGGGTCTGTGCTGGGAAATCCAAGGGGCAAGGCCG 

10 TTTCCCGTCCTGGTCCGTCGTTGAGTCCCCTGCCGTGGGACAGCGCTCAGCAT 
GGCACAGACGCTCAGCCGTGTCATGTGGCTTGTAGAGCGAGGGTGTGCGGCC 

ggcatcatcctcagggaactctggagggaCtgggcctcagttggcctggcct 
ggcgcggggctgccagccaagagccttcctccatcccattgagagccagggc 

AGGTCCTGGAGATGGCAGCCTGCAGGTTGAAGCCCCCGACAGCTGGGGGAGG 
15 CATTGCTCTGCACAGGTAGGGCCATGGGGCCCGCTGGCCAGGGTGGCCATTG 
CTCAGTGGCACCAGCTGGGCAGACAGCCGCATGCGCTCACCACTCGCCCTCC 
ACAGATGGGAGCCTGGGAGAGCCCTGCCTCCCGGAACGGACATCCTCCTTCA 
CCAAATCCGCAAAATAGGCCTGATGCCAGGAACGGGGCACTGCAGGGCCCTT 
GGGAGGGAGGTAAGCGGGCAGCCGGGCCAACCTGCTGCTTGGGGAAGGGCT 
20 GTGGCTGGCCCCAGCCTCCTACCGCTGGCCCAGTGGTCTCTCCACACATCACC 
CAGTGTGGAGTGGTTTTTTGTCTTTCTTTCGAAACAGGCTTTCCTCACTGAGGG 
CTCACCATGGGGCTGTGCTGGGGACACTGGATGTGTCCAGGGTCCGTCAAGA 
AGCGTTGGGGGTAGTCAGGGTCTCAGGGACCACTCTGCTGTCCTCGAGTCTCC 
TGGTGGTGACCAGTGGTGACTCTTGTGGCGCCCGGGCCACAGCTACTTTGTTC 
25 ACCTCCTAGGAGGGAAGAGGGCGAGGGTGAGCCTAGGGCAGATGCCAGGGC 
AGTGAGGCTCACGTGGGTCTCTTTGCCTGATGTGCCGGTGTCCAGGCTGCAGT 
GGGCACCCGAGCCCTGTGGGAAGCCTCCTGGGTCTGCAGCTGAGCAAAACAG 
CAGCCGCCAGATGATGGGCCCTAAGTGGCTGCAGATAGCAGGCACCATTAAA 
TGCAGAGGCCA.-'CTGAGGCTGCAGGGTCAGCACAGGTGGCTTTGGTGCCCTT 

/I- 

30 AGGGAGGGGCAGCGGTGGAGCCCAGGTCTCTGAAACATTTATGACTTCATGC 
GTCAGCTGGAGGCTCTTTCACCGCCCAGACAAAAGGCCCTTGCGTCAGAGTG 
TTCCATGTGGGGCGGGAGCTTGGGCTGAGAGCTTGGGCCCCTGCTCCCAGAG 
AGGCCCAGAGCTGGGGTGGCCACACCTGCTGGGAATGCTGCTGGTGGGACTA 
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GGTTAAGGCGAGAAGGAGGTGTCCATGGCCAGGTGGATTAGTGCAGATTGTG 

ATGGTGTGGTGGTGCTCACGGTGGGGGACTTGCCTATGGGAACCTCATCACA 

GCCCCACATCGGGCCCTTTTACAGATGGGAAAACTGAGGCATGGGGTGCATA 

GCTAGTGAGGAGCAGAGCCAGATGGGAACGAGGCGTCGACTTCAGAGCTGAC 

CTTTGTGCATTTGTGCATTGTTTGTGTTGTGATAAAATACACGTAACGTAAAA 

TTGACCTTTTTAACCCATGTTTCATGGTCTTCGGTTTACTCGCAGTGCTAGGCG 

TCCGACACGTCTTTCTAATTCCTGAACATTTCCATCATCCGCAAGAGGAAATT 

AGCAGTCACCCCTTTCCCCCATGCCCCCAGCTCCTGACAACCACAAGCCCACT 

TTCTGCCCCTGGATTTAGCTATTCTGGATGTTTCATGTGCCTGAAATCATGGCA 

AACGTGGCCCTGTGTGTCTGGTTTCGTTCACCGAGCGTCCTGTCTTCACTGTCC 

GCCGTGTTATAACACGCAAGGGAATTCCCTTCCTTTCTGAGCTGAGTAGTGGT 

CTGTGGTGTGGATGGACTACGTGCAGCTGTCTCTGCCTTTGGCTGTTGTGGCT 

AGAGGCTCCTTCCTGAAAGCCTGATACTTAGAATGGATTTAAAAACAAATTC 

AAATGTGTGTTGCTTATGAAAATCACAAAATTAAGTATGATGCCATCAAAAT 

GAAAAAATGGCTAAAGATTTTCAAGCACACACACCTAAAGATAGCAGACATr 

CCATTCAACCTTCATCTCAAAGAATTAGGGCAAGGAGGGCGATTCTAAATGG 

CTGGAGACATAGTCAGCTGTGAAGCTGTAACTTTCATGAAGACAGTGACGTG 

GTTTGTCACTGCCCGGATCTCATCTTGAATTCCCTGTGTTGTGGGAGGGACCT 

GGTGGGGGGTAATTGAATCACGGGGACGAGTCTTTCCTGTGCTGTTCTCGTAA 

TAGTGAATACGTCTCATAAGATCTGATGGTTTTAAAAAGAGGCATTCCGCTAC 

ACAAGCTCTCTCATGTTTTGCCTGCCACCATCCACGTAAGATGGGACTTGGTC 

CTCCTTGCCTTCCGGCATGATTGTGAGGCTTCCCCAGCCACATGGAACTGTAA 

GTCCAATTAAACCTCTTTCTTTTGTACATTTGCTCAGTCTTGGGTATGTCTTTA 

TCAGCAGTGTGAAAACGGACTAATACTGACAATAAATTAAAAACTATTTAAA 

ATGCAAAGGAAAATATAAATTTAAATCACCTCTTCGAGTCTCTGGTAGATCAA 

GAAAACAAAAATAATCTGAATTATTGATATAGTTAAATTAATAGATATATCA 

AACTTTGTAACTTAAAAGCAAAAGATGTACCTTTTACAGTGTTCCTGGAACAC 

TCACAAACATTGATCATATACTAGACTGCAAAAACTTCAAGACATTCTAAAAT 

TAAGAACCTATGCTGGATATCiCTGATTTCAACGTATGTTTAAAATATATGTT 

ATTAATTACAAGTTTAATTAATGAAAGTCTAAAGAAGAAAAGTCTACTTAGA 

AGTAAAAACATCAGCAATATTACTGTGTAATAGATCTGAACATTAAAGGGGC 

ATATATGCATATCTGTACACCGAGGCAAATTTATAGCTACAGGCATTTTCCTT 

TTTTGAAAAAAGTAAATATGTAAGTTAAAAAGTTCACAGTAAGAAAAGTGAA 
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TGTAAATAAGAGATAAGAATAGAAAATAATAGGAAACAGGAAAACAGACTT 
GAGGGTTCTTTTTGGGAGGTGATTCTTGGTTCTTTGAGGAAAACAATATATAT 
CTTATAATTTCAATCATGGAAAAACCCCAAGTATATATCTAAATTTAGACATG 
AGAAGGCTCTTTGGCAATCGATTTAGAAAAGTTTTGTTTGTGTTTTTAAGACA 

5 GAGTCTCACTCAATCACCCAGGCTGGAGTGCAGTGGTGCAATCTTGGCTTACA 
GCAACGTCTACCTCCCGGGTTCAAGCGCTCCTCCCCCCTCAGCCTCCCAAGTA 
GCTGGGTTTACGGGCATGTGCCACCACCCCTGGCTAACTTTTTGTATTTTTAGT 
AGAGACAAAGGTTTCATTATGTTGGCCAGGCTGGTATTGAACTCCTGAGCAC 
. AAGTGATCCACCCACCTCGGCCTCCCAAAGTGCTGGGATCACAGGCATGAGC 

10 CACTGCACCCAGCCCAGTTTTTAATGTGACAGTACITrGTAAAATTGTACTCC 
AGTGAATTTGAAAAATGTTGTGAGATGGCTTATTCTTTGGGAAAAAATGTTAA 
GTCAAAATTAGGAAGAAGTGGAGACAATCTCAGTATGTGTGAGATGAATCTG 
AACCCTCATCAGAACGTCCACAGCTGACGGTGAACACATTGGCAGGTGAGAT 
ATGTGGAGACATGGCACGTGGTGAAAATGCTTGGCAAAGTGTTCAAGGCAGG 

1 5 ATGGAGATACCGTATGGGCATTCACTGGACAGTGTTTTCATCCCGTATGTTTG 
AACAATTTTTAATGTGGGATCAAAGGTAGGAAATGCTCTTTACAAAGTCATCA 

CATTGAATTAGGATGCCGTCTGCTACTGAGAGAGCCAGTCATGTTAGTGGCTC 
AATGGGCAGGGTGCCCAGGTTAGCTGTGAACAAAGTGATCTCTGTGCTGCCTC 

20 ACTAGCAGAGAGAAAAGAAAACTACAGACCAGTGATATGTACACAGAGAGC 
AATGAAAATCCTCATTATGATGGCAGACTAAACCCATAGTGTGTTAAAAATG 
ACCACACATGTAGGCTAGGCACAGTAGTTCATGCCTGTAATCCCACCAGTTTG 
GGAGGCTAAGGTGGGAGGATTGCCTGAGCTCAGGAGACCAGCCTGGGTGACA 
TAGTGAGACCCCATCTCTTTTTATTAAAAAAAAAAAAAAGACCATACATACA 

25 CACATGCACACACAACAGTGGTAATGGATTATAATTCATGAAAAAAGAATCT 
ATGCATCCACAGGGACACACCCATGTATATAAACAAATAAGTGGAAGAAAGG 
AATCCAGTTCTCACGTTAGAATCAACAGGGCTGGAAGGCCATCAAGGAAACC 
CCGTGGCAGGGGCAGGGGTTCAAAGAGCTGGTGGTTGTAGTCTCAGTGTCTC 
CCACCACCACTTACTAGTTACAAAGGAAAAAGAATCi\CTTTGTAGAGGCAAA 

30 GCTGGCGACACCACCCGAGCCAAGTGATCCAAGTTAACATCACCAGGAATGG 
GACAGGCCAGCTTCACCACCCTCCGGATGGACAGCCTCGGTAGCAGAACTCC 
TGCCAAAGACACAGCTGTCCAGGGTCGAATCATCCGGAGACACAGGGCAAAC 
CCACATGGAGGGAGACACACACAGTGGCCTGAATACTTTGAAAATGCCAGGG 
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CCAAAAAACGTAGAAGGCTAAATAAAGTACCGTTCAGGGTGAAGGGGAAGG 

AAGAAATGTGATGCCTAAGTGGGAGTCTATGATTCCACTATGTTATGTATTAG 

TCCATTTTCAAATTACTGATAAAGACATACCTGAGACTGGATAATTTATAAAG 

GGAAGAGGTTTAATTGACTCACAGTTCCACATGGCTGGGGAGGCCTCACAAT 

CACAGCAGAAGATGAAGGAAGAACAAAGGAACATCTTACATGATGACAGGC 

AAAGAGATAATGAAAACTGAGCACAAAGGGTTTCCCCTTATAAAACCATCAG 

ATCTCGTGAGACTTATTCATTACCATGAGAACAGCATGGAGGAAACCACCCC 

CATGATTCAATTATCTCCCACGAGGTCCCTCCCATGACACGTGGGAATTATGG 

GAGCTACAATTCACGATGAGATTTGGGTGGGGACACAGCCAAACCATATCAT 

TCCACCCCGGCCCCTCCCTAATCTCATGTCCTCACATTTCAAAACCAATCATG 

CCTTCCCAACGGTGCCCCAAAGTCTTAACTCATTTCAGCATTAACTCAAAAAT 

CTGCAGTCCAAAGTCTCATCTGAGACCAGGCAAGTCCCTTCCACCTATGAGCC 

TGTAAAATCAAAAGCAAGTTAGTTAATTCCTAGACACAATTAGGGTACAGGC 

CTTGGATAAATACACCCATTTGAAATGGGAGTAATTGGCCAAAACGAAGGGG 

CTAAAGGCCCCATGCAAGTTCGAAATCCAGTGGGGCAGTTAAATCTTAATGC 

TCCAAAATGATCTCCATTGACTCCATGTCTCACATCTAGGTCATGCTGATGCA 

AGAGGTGGGTTCCCATAGTCTTGAGCAGCTCTGCCCCTCTGGCTTTGCAGGGT 

ACAGCCTCCCTCCTGGCTGCTTTCACAGGCTGGCATTGAGTGTCTGCAGCTTT 

TCCAGGCACATGGTGCAAGCTGTTGATGGATCTATCATTCTGGGATCTGGAGG 

ACAGTGGCCCTCTTCTCACAGCTCCACTAGGCAGTGCCCCAGTGGGGACTTTG 

CATGGGGGCTCCAACCCCACTTTTCCCTTCCTCACTGCCCTAGCAGAGGTTCT 

CCATGAGGACCCCACCCCTGCAGCAAACTTCTGCCTGGACATCCAGGCATTTC 

CATACATCTCTGAAATCTAGGCAGAGCTTTCCATACCTCAATTCTTGACTTCT 

GTGCACCCACAGGCTCAACACCACATGGAAGCTACCAAGAGTTGGGGCTTGC 

ACCCTCTGAAGCCACAGCCCAAGCTGTACCTTGACACCTTTTAGCTATGGCTA 

GAGCGGCTGGGACACAGGGCACCAAGTCCCTAGGCTGCACACAGCATGGAG 

GCCCTGGGCCCAGTCCAGGAAACCATTTTTGCCTCCTAGGCCTCTGGGCCTGG' 

GATGGGAGGGGCTGCTCCTAAGGTCTCTGACATGCCCTGGAGACATTTTCCCC 

ATTGTCTTAGTGATTAACATTTGGOTCTT.GTTACrTATGCAAATTTCT 

CTGCTTGACTTTCTCCTCAGAAAATGGGTTTTTATTTTCTATCACATTTTCAGA 

CTGCAAATTTTCTGAACTTTTATCTCTGTTTCCCTTTCAAAACTGAATGCTTT^ 

AACAACACCCAAGTCACCTCTTGAAAGCTTTGCAGCTTAGAAATTTCTTCCAC 

CAAATACCCTAAATCATCTCCCTCAAGTTCAAAGTTCCACAAATCTCTAGGGC 
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AGTGGTAAAATACTGCCAGTCTCTTTGCTAAAACATAGCAAGAGTCATCTTTA 
ATCCACTTCCCAAAAAGTTTCTCATCTCCATCTGAGACCACCTCAGCCTGGAT 
TTCATTGTCCATATCATTATCAGAATTTTGGTCAAGCCCTTCAACAAGTTTCTA 
GGAAGTTCCAGACTCTCCCACATTTTCTTGTCATOTCTGAGCCCTCCAAACTG 
5 TCCAACCTCTGCCTCTTACCCAGTTCCAAAGTTGCTTCCACATTTTTGGGTATC 
TTTACAGCCGCACCCCACTCCCAGTACCAATTTACTGTATTAGTCCATTTTCAT 
GCTGCTGATAAAGACATACCTGAGACTGGGTAATTTATAAAGGGAAGAGGTT 
TAATTGACTCACAGTTCCACATGGCTGGGGAGGTCTCACAATCATGGCGGAA 
GATGAAGAAAGAGCAAAGGCACATCTTACGTGGCAGCTGGCAAAGAGAGAA 
10 TGAAAACAGAGTGCAAGGGGTTCCCCCTTATAAAACCATCAGATCTCATGAG 
ACTTAATTCACTACCACGAGAACAATATGGGAGAAACTGCCCCCATGGTTCA 
ATTATCTCCCACTGGGTCCCTCCCACAACACATGGGAATTATGGGAGCTACAG 
TTCAAGATGAGATTTGGGTGGGGACACAGCCAAACCATATCAGAGTTGAATC 
CTGTCTGGGAAGAGATGGCTGCAATAGCCAGTGTCTGGGCAGCTGATGATGT 
1 5 GAGAGTGTGAGGT AGAGAACTGTAGTGTGCAATAGAGACCTGTAGTGTGCAT 
GTCAGTGTTGTTTCCCGATTTTGACAACAGTACCACATGGTTACAGAAGAGAA 
CACCCTTATTCCTAAGAAATCCATGAGGAGGCGTTTAGCAGAAAACAGGCAT 
GGTGTCTCCAAGGGCTCAGAAAGATTCAACCAAAAGAGGGAAGTCTGGAGA 
GGGACAGGATGGAGCAGGACGGGGGCAGTGTAGAAACCGCAGGCAAGTCTA 
20 AGCAGAGCACGTTTGGGAGTTCCTTCTGCTATTCTTGCAACTTCTCTATAAGTT 
TGAAATGAAATCATATAAATCGAAAGTTAGAAGAGAATTACCACATCATGTT 
GGATTTCACTCATAGCCACGAGAACGGGTGGTGGTCTCTTCTGCAGTGAACTG 
ATGCGATGTGTGATGTCACTAGTTCTCATCCAAAGCAGATGACAATTCCAGTT 
ACrCCTCACCAGACATAAGAAAATGCTTTTAGTAGATCAGTTTTTCTGATTGT 
25 AAAGTTCAGAAATGTTAATCAGAGAACGTGCAGAAATTTTTAAAGAAGAAAA 
GTAAAATGACTCATCGTACCAAAAGCCAGAGATGCCAGCTTTTAGTTTCCTGC 
TAGTCTTTTTCTCGGCGTGTGTGTGTGCGCGCGTGCATGCACATGTTTCTACTT 
GGTCTTTGTGTGTGTGTTAAGGGCGTGTACACACACAGTGCAGTCAGGAGTG 
AGTTGCACGTCl^VCAGCTAGGCTGCTTCCCCGAGGCTGTGTGGCCATCAGCCA 
30 GGCGTCCCCAACCCCCAGGCCATGAAGCACTACCAGTCCATGGCCTGTTAGG 
AACCAGGCTGCACAGCAGGAGGTGAGTGGCAGGTGAGCAAGTGAAGCTTCAT 
CTGTATTTACAGTCACTCCCCATCACTCGCATTACCGCGTGAGCTCTGCCTCCT 
ATCAGATCAGTGGCGGCATTAGATTCTCACAGGGGTGTGTACCCTGTTGCAAA 
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CTATGCATGTGAGGGATCTAGGTTGCCTCCTCTTTATAAGAATCTAATGCCTG 

ATGATCTGTCACTGTCTCCCATCACCCCCAGATGGGACTGTCTAGTTGCAGGA 

AAACAAGCTCAGGGCTCTCACTGATTCTACATTATGGTGAGTTGTATAATTAT 

TTAATTCTATATTACAATGTAATAATAGTAGAAATAAAGTGCCCAATGCAAGT 

AATGTGCTTGAATCATCCCAGAACCACCCACACCACTGGTCTGTGGAAAAAC 

TGTTTTCCATGAAACCAGTCACTGGTCCCAAAAAGGTTGGGGACCACGGCCA 

TCGGCTGTTTGCTTTATGAATTTGTGCCTCAGTTTCTTCATCTCTGAAACGGGA 

ACGATGGGAGCCCTCACCTCGTATGGTGGTTTGAGGATTGAATGAACTCATTG 

TCCAGCACCCTTAGCTTCTTGCCAGCTCAGGAAGTGCTGAGCAGGGTAACCCC 

TATCACTGCGTGTGTGTGTGTGTATATGTGTGTGTGTGCATACTTGCCTATGTG 

TGTCTGTGCAAATACTTGGTAAAATTGAGAGGGTACTAGACAACTCATTTGTA 

TCTTCTTACCTTTTGACACATGGCAGACATGTCTCACATCTGTTCTTCCTCCAA 

ATGTGACCTTTCAATGGCTGCCTGGCATTTCCCCTGTGTTATTGAACGTAATC 

AACTCTCTATTGACGGCTGGGTTATTTCCTAATTGACTGCTGTAAATAGCATT 

GTGATGCAAATCCTTGTACACTCATTTTTGCCTGGAACTCAAGATTATTTCCTT 

GGGTCACATTTGCAGGAGAAGACTGACAGGATCAAAGGGTTCCTGCATTTTT 

ATGGCTCTGACACCTTTGGCCAAATCACCTTCCGGGAGTGTTTTCCTAATTTA 

GCTTCCTACTGTGCTGCTGGACATCACTGACTCCCCCCGCCAACCCCCACCCC 

ACACTGCACCACAAAGCTGGGTATTATCATAAAAACACTGCCATTATGATGA 

AGCAAAAATAGAATATCCTTCCTCCTCTTAGTGAGGGGAGTATGTTTATGTTT 

GTTGAGCTCTTTGTATTTTAGCCACCCATTCACTGGCCCACCTGCCCCTGGTGC 

ACCTGCAGCAGCAAGACTGTCATCTAGGTTGTTGGGGAAACCAGGAGTGAGT 

GGAGCACGAGGGTCACCCTCCAGGAAGTGACCTGCAAGAGAATGGAGAGAG 

TCATGGACCGGCATGGCGTGGTAATGCATGGTGCGTGTCTCTTAGCAGTAAG 

ACCTTGAGAAGGAGCCAGGCAGGGAGGAATGGGAGGGTGAAGGAGTGCCTT 

TTGTGGAGGGGAGTCGGACACAGATGGGATTTCAGCAGAGACCTGATGGAAT 

TGAGGGAGTGGGTGGTGTGTGCTTCTGCAGGAAGAGCACCCAGACAGAGGGC 

ATGGCACATGCAAAGGTCCTGTGTTGGGAACGTGCTTGGTGCATGTGAAGAA 

CTGTGCAGGGAAGGTGCTGGAGATAG^ACGGGAGGGCCACGAGGGTTAAGA 

TCCTGAGAGCAATGCGGGGGGTGTGAGTGATGAGGGACAACATCAGTCCTCT 

CTTGGCTTCTGTACTTTAAAAAAATCATTTTGTTTTGAAGCAATm 

CAGAAAAGTTCCACGACAGTACTGAGAGTTCCTGTGGACTCCTAGTATTAAC 

ATCTTAATAACGGGGTGTATTGACCAAAACCGACACCTGATACATTACCGTG 
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AACTCAACTGCAGATCTGATTCAAATCCCGCGTTTTTCTAGCATCCGGTTTCT 
GTTCTGTGATCCCCACGTTGCTTTCAGCTGCCAGGTCTCCTTTCGTGTCCTGCA 
GCCTGTGCCAGCTCCTCAGTCTCTCTTTGTCTTTCGTGACCTTGATGTTTTAGA 
TAAGTACTGGGCGGCTGTTTTGTAGAACGTCCCTCGGTTTGGATTCTCTGATA 

5 TGCTCTCATGATTAGATCAAGATTGTGCATTTTTGGCAAGAACACCGCGGGAG 
TGTTGTTGGGTCCCGAGGAGTCCCTGTGGCAGGGCTGTGTGAGGTCAGCATGT 
CTTAGGGCGGGTTCATTTGACCTTGACCACTCAGGTAGGAGTGTGTCAGCTGG 
GCTTCTCAGCTGTAAATTAACCATTTTCCGCTTCATAATTAGTAAACACCTTG 
AGGAAGCTACATTGAGACATTGCAGCTGCCTTTTTTCTCCTGAGATTTTG^^ 

10 CGCTGATTTTAGTGCCCATCAGCTGGTTTTGCTTGTGACAGGCAAAGTCTGTG 
. GTGCTTGCCAAATGGGGATTTGTTTATTTCCCACTTTCTTTCTATGTTTGTTAA 
TTTGGATTCTTCTCCAGGAAGAGCTGTCCCCTTTGTTCTGGTTATGTATCCAGT 
GGCTTATGTATATCAGCATGGTTTCATAGATACTTCTTTTGtGTTATGGGCTAT 
AAGCCAATGCTATTATTATTTATTTTCCTCCATTTATTTATTTATTTAGAGAC 

15 GAGTTTTACTCTTGCACAGGCTGGAGTGCAGTGGCATGATCTCGGCTCACTGA 
AACCTCTGCCTCCGGGGTTCAAGTGATTCTCCTGCCTCAGCCTCCTGAGTAGC 
TGGGATTACAGGCACGTACCACCACGCCCAACTAATATTTTGTATTTTTAGTG 
GAGACGGGGTTTCATCATGTTGGCCAGGCTGGTCTTGAACTCCTAACCTCAGG 
TGATCCACCCGCCTTGGCCTCCCAAAGTGCAGGGATTACAGGTGTGAGCCACT 

20 GCACCCAGCCTCATTATTTATTTTCATTGCTCAAATTGTGCTGGCTTTTGCCCC 
AGAAACAACCTTCCATTCAGCTTCCATGCCCCTTTGGTGGGCCCTGTCTTTTCT 
TGGACACTGCCTTCCTGGCACCCCAAGATGCTCTGGGCTCAGCTTGTATTTCC 
TTGCCTCAGTCTTGAAATCACCCACTTCTCCAAGGAGCCAGGGTTCCTTCCAT 
TGGAGAGTGGTCTTTAGAAACCCAGGCCTGGGTGCATGGTGTGCTCATCGCC 

25 ACTGTGGACCTGCTGCTCTGCGCCCACTCGGCCGACGGAGCTGGGGAATATG 
TGTATGGCACCCCCACGTCTACGCTTCTGTGTTCATCTGCAGATATATTCATAC 
ATGAGTGGGCTCATACTGTCTCCAACTCCAGTTCTGCCCCAGTGGGGCCAATT 
CTCTCTCTTTGTTGTAACACCTTTCTTCAATAGTGAGAAACCTGGCTCTCTCAC 
CTTCACCTGCTCGTCCAACCCCAGCTGCAAGGGAACTGGCTC^ATACTTGCTG 

30 ATGCGTGCCCCCATGAGAAGTGACTGACCACGCAGAGCCTGGCGCCTGTGCA 
TGGCGTCTGTGCACGGCGCGTCTTGTCTTCAGCCTCACTGCGTCCCATCAGTG 
CTTTTCCCCACGGTGACTTTGAGTATTTCTTTACTTCTGTACTCCCTTCAGTGT 
GATTATGTCATTCATCTGTGACATGTTTAGATTTATCTGTTAGGTTGGTATTCC 
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ATTTTGGGTGTCCCCCGATTCTGATTGTTTTTAGTTACTTATCTGGGGGTCTGT 
GTGAAGGGAATGTAAAGCTGCTGTGACTCCTGGAGTCAGAGGTGCACAAAGA 
GTGTCCTCAGAGGAGAGCCAGGCGCCCACCCTGCCAGCCCTTCCCTCCCCACT 
CCATCCCCCACCCCCCCACCCTGGACAAAACCAGCTCTTCAGCCTTTCATGTC 
5 TCTCTTGCAGGCCTCTCGCACAGATGAGAAAATACACGTGTGTTTTCCTGTAT 
CCCCTCTTTCTTACAAAAAGGGAAGCACACTATCAATACTCTTGTAGATCTTT 
AATTGCGGTAAGATACACTAATGCAAAATTTACCATTTTAACTATTTTTAAGC 
ACACAGTTCAGTGGCATGAAGTACATTGATTTTGTTATGCAACCATCACCACC 
ATCCATTTCTAGAATTTTCTCATCCTCCCAAACTGAAACTCCATCCCCATTAAA 
10 CACCAACTCCCCCATTCTCTCTTCCCCAGCCCCTGGCACCTACCATTTACCTTT 
CGTCCCTATGGATTTGACTAATCCAGGAACTTCATAAAAGTGGGATCATTGAG 
TACTCTATTTGTCTTTTGTGACTGGCTTATTTCACTTGGGCATAATGTCCTGAA 
AGTTCACTCATGCTGTAGCATGTGTCAGAATTTCCTTTTCCTTTTGAAGGCTGA 
ATAATATTCCACTGTATGGATAGACCACATTTTGCTTATCCATTCATCTTTTGA 
15 TGAACATTTGGGTTGCCTCCATGTTTTAGCTATTGTGAATAATGCTACTATGA 
ATATGGGTGTACAAATATCTTTTCAACGCCCTGCTTTCAATTCTTTTGGGTATA 
TACCCAGAAGTGGAATTTCTGGATTATATGGTAATTATAGTTTTAATTTCTTG 
AGGAACCACTATGCCGTTTTCCACAGTAGCTACCACATCCTCACCAACACTTG 
TAATTTCTGGGGTTTTTTAATAGTAGCCATTCTAGTGGGTGTGCAGTGGTATCT 
20 CATTATAGTTTTGATTTACATTTTCTTATGAGTAGTGATGTTGAGTATCnrTTTC 
ATGCTCTTATTAGCCATTTGTATATCTTCTTTGGAGAATATCTCCTCAGCTCTT 
TTGCCCACTTCATTAAATTGAGAGGTTTGTTTTTGCTGTTGAGTTTTAGAACTT 
CTCTATATATTCTGGATATTAATCCCTTATCAGATATAGGATTTGCAAATATTT 
TCTCTCATTCTGTGGGGTGCCTTTTTACTCCGTTGATAGTGTCTTTGTACAAAA 
25 ATTTTTAATGTTCACGAAGTCCAAGTTGTCTACTTTTTGTTTTGTTGCCTGTGC 
CTTTGGTCTTTGTGTTAGTCCATTTAGCATTGCTATAAAGGAATACCTGAGGC 
TGGTAATTTATAAAGAAAAGAGGTTTAATTGGCTTATGGTTCTGCAGGCTGTA 
CAAGAAACATGGTGCCAGCACCTGCTTCTGGCGAGAGCCTCAGGAAGCTTCC 
AA-CACGGCAGAAGGTGAAGCGGGAGTAGGTGTCTCATATGATGAGACAGG 
30 GAGCAAGAGAAAGGAGGTTCCAGTCTCCTTTTAACAACCAGATCTTGCATGA 
ACTCATTACCTAATCATGGGGAATCTGCCCCTATGACCCAAACAGCACCCACC 
AGGCCCCACCTCCAACGCTGGGGATCACATTTCAACATGAGAGTTGGAGGGG 
ACAAATATCTGAACTCTATCAATGTTATATCCAAGAAATCATAGCCAAATCCA 
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ATGTCATGAAGCTTTTGCCCTATGTTTTCTrCTAAGATTTTTATAGTTTTACG^ 

tgcacatttagatatttgatccattttgagttaatttttgtaTatggtgtgaag 

TAAGGGTCCAGCTTCCCTCTTGCATGTGGATTTCCAGTTTTCCCAGCACCATTT 
GTTGAAAAGACAGTCCTTTTCTCATTGAATAGTCTTGGAACCCTAGTCAAAAA 

5 TCATTTAACCATTTATGCCACGGTTTATTTCTGGGCTGTCTCATGTTCCATTGG 
TTTATATGTCTGTCTTTITGCCTGTGTCACACTGTCnTGATTACTGTAAGTTTAT 
AGTAAAGTTTTGAAATCAGAAAGTGTTAGGCTTCCAGCTTGGCTm 
AATATTGCTTTTGGCTATTTTAAGTTCCTTGAGATTCGATATGAATTTTAGGAT 
GGGTTTTTCTATTTTTGCAAAAAGTGTCATTGGGATTTTGAGAGGGATTGCAC 

10 TGAATCTATAAATCACTTTGGGTAGTATTGATATCTCAACATTGTCITCTAATC 
CATGAACATGGGATATGTTTCCATTTATTTGCATCnTITTAAGm 
AATGTTTTGTGGGTTTTGCTGCAGAAGTCTTTTACCTCCTTGGTTAAGTTAATT 
CCTAAGTATTCTACTrGATGCTATTATAAATTAAATTGTTTTCTTAATTTCATT 
TTTAGATTGTTCATTGTTAGCATATAGAAATGCAACTGATTTTGCATGTTTACC 

15 TTAGATTCTGTAACTTTGCTGACTTCATTTATTAGCTCTAATAAGTTTTTGTGG 
AATCTrTAGGGGTTTCTACATATGAGATTATGTTATCTGTGAATAGAAATAGT 

TTTACGT C ITIT 1 f TTCCATTCTGGATGCCTTTTATTTCTTTTAAl 1 1 H l hji 1G 
CCAAATTGCTCTGGCTAGAATTTTCAGTACAATGTTGAATAGAAGTGGTGAAA 
GTGGTCACCCTGGCCTTATTCCTGACCTTAGAGGGAAAAATTTCAGTCTCATC 
20 ATTATGATGTTAGCTGTGGATTTTCACATATGGTTTTTATTATGTTAAGGTAGT 
TTCCTrCTATTCCTAGTTTGTTGAGAGTTTTTAATAAGGATGTTGAATTTTGTC 
AAATGCTTTTTGTATCAATTGAGATCATGTGATCATGTGAGTGTTATTTCTTCA 
TTCTGTTAATATGCTCTTTTACATGGATTTATTTTTGTATGTTGAACCATCCTT 
GCATTCCAGGACTAAATCTCACATGGTCATGATATGTAATACTTTAACTATGT 

25 rrCTGAACTCAGTTTGCTATTTTGTTGAGAATTTTGCATCAGTm 

ATATTGGTCTGTAGTTTTCTTGTAGTCTTTGTCTGGCTTTAGTATCAGAGTAAT 
GCTCTCCTCATAGAATGAGTTAGAAACCTTTCCCCTCCTCTTTAATTTTTTGGG 
AAAGATTGAGAAGGATGGGAGTTTATTCTTTAAATATTTGGTAGGATTCATTG 
GGGAAGCCATCAGGTCC^VGGCTTTTCTTTGTTGAGAGTTTTTTGATTACTGA^ 

30 CAATCTCCTCACTAGTTATAGGTCCATGAAGATmGTATTTCTTTGTGATTTA 
GTTTTGGTAGGTITCGTGTTTCTCAGAATTTGTCCATTTCATCTAGGTTATCCA 
ATTTGTTGATACACAATTGTTCACAATACTCTCTTATAATCCTTTTTATTTCTG 
TAGAATTGATAGTCATGTCCCTATITTCACTTCTGATTTCAGTAGTCTTCTCTC 
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TTTTTTACTTAGTCCATCTAGCTAAAGGTTTGTCAATT(nTAAAATCTTTTCAG 

AGAAGCAACTTTGATTTAATTGATTTTTCTCTATTGTTTTTCTATTCTCTATTTT 

GTTGGTCTCTGGTGTGGTATTTATTATCITCTTTCTTGTGCTAGCTTTGGGTTTA 

GTTCTTTTGCTAGTTCCTTAAGTGGTAAAGTTAGGTTATTTACTGGAGATCTTT 

CTTGTTTATTAATGTAACAATTTATAGGGAGGGAGTGGAGCAAGATGGCCAA 

ATAGAAGCCTCCAATTGTCCTCCCCAACAGGAACACCAAATTTGAGAACTAT 

CTACACAAAAAAGCACCTTCATAAGAACCAAAATTCAGATAAGCAATCACAG 

T A TnTOPTTTT A A rTTT AT ATTGCTG GAAGAGGCACTGAAGAGGGCAAGAG A 

GACAGTACTGAATCACTGACAGCACTGTGCCTCCCATCCCCCAGCTATGTGGC 

ATGGAGAGAGAATCTGAGAGCTTGGGGGAGGGAAAGTGCAGTGCTTTGGGA 

CTGTCCATCGAGCTCAGTGATTCCCTGTTGCCACAGAAAGCAGAACCAGGCT 

AAATTCAGCCCACACCCAGCACAGAGGGAGCATTTAAACCAGCCCTAGCCAG 

AGGGGAATCACATATCTCAGGGGTCAGAAACTGAGTTCCAGAAAACCTTGCC 

ACCACGGGCTAAAGTGCTCTGGGGTTCTAAATAAACTTGTAAGGCAGTCTAG 

GCCACAATAACTGCAATTCCTAGGCAAGTCCTATTGCTGAGATGGGCTCAGA 

GCCAGTGGATGTGGGGGGCACACAACCTAGTGAGACACCAGCCAGGGTGGCT 

AACCCCAGGGAGCACAGCTCACAGAAACAAAAGTGACTCTTTTCCTCTGCTTA. 

AAGAGAGGAGATGGAGCAGTAAAGAGAACATTGTCTTGCATCTTGGATACCA 

GCTCAACCACAGTAGGAGAGGACACTGGGCAGAGTCAATGGGCACACATTTC 

AGGCCCTGGCTCTTGGACATTTCCAGACACACCCTGGGCCAGAAGGGAATCC 

ACTGCCTTGGAGGGAAGGACCTAGTCTTGTCAGAATTCATCATCAGCTGACTA 

AAGAGCCCTTGGACCCTGAACAACCAGCAGTGCCAGGTAGTATCCTGTGGGC 

CTTGAATGAGATTCTGAAATATACTGGCTTCAGGTACCAGCTTGACCACAGTG 

GGGTAGAGCACCAAATGGGCTCTTGGGGTCCCTGATTCCAGGCCTTGGCTCTT 

GGACAGCATTTCTGGACCTGCTCTAGGCCAGAAGGGAGCCCACTGACCTGAA 

AGGTGAGTCCACAGCCTAGAAGCCTTCACCACAAGCTGACTGAAGAGCCCTC 

AGGCCTTCAGTGGACAACCATGGTAGCCTGACAGTACTGCCTATGGGCCTGT 

AGTGGTGGTGGCCACAGGGAGAGGCTCCTTTGCCTATGGAAAGGAGAAGGAA 

TTCTGGGTATGACTTTGTCTTGTGCTACTACTGAi jTGCCAGCTTAGCCTCTGTA 

GTATACCACACCAGGTAGGTTTCTAAGGTCTTTGACTACAGACCCTTGCTCCT 

GGATGGCATCTCTGGACCCACCTTGGACTTGGGGGAACTCACCACCCTAAAG 

TGAAGGACAAAAGCCTGGCTGGCTTTACCATTTGCTAATTGTAGAGCCCTGGG 

GCTTTGAGTGAACATAGGTGGTAGCCAGGTAGTGCTTACAGCGGGCCTTGGG 
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TGAAACCCAGTGATGTGTTGGCTTCAGGTCTGACCAGCACAGTCCCAGTGGTG 
GTGGCCACAGGTAAGCTTGTGTCACTCCTCCCCAAGCTTCAGGTGGCTTAGGA 
TAGAGAGAGAGTCCATTTCTTTGGGAGAAAGTAAGAGAAGAAAACAAGAGT 
CTCTGCCTCGTAATGCAGAGAATTCTTCTGGATCTTATCTAAGACCACCAAGG 
5 CAGTATTTTTTTTAATATTTATTTTTTGAAACAGAGTTTCACTCTTGTTGCCAA 
GGCTAGAGCGCAATGGTGTGATCTCAGCTCACCGCAACCTCTGTCTCCCGGGT 
TTAAGCGATTCTCCTGCCTCAGTCTCCTGAGTAGCTAGGATTATAGGCATGTG 

CCATCATGCCTGGCTAATTTTTGTTTGTT^ 

ACA^GGTTTCTCCATGTTGGTeA^Ge T GG TCTTGA AeTGGGGACGTCAGGTGA 

10 TCTGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCG 
CCTGGCCCCAAGGCAGTATTTTTATGAGTCTGCAAGAACCACAGTGTTACTGG 
GCTTGGGATGCTGCCTAATGCAGATACGGCTTAGATCACAACACCCAAGTCG 
CTTTGAATACCTGGAAAGCTTTACCAAGAAAGATGGGTACAAACAAGCCCAG 
ACTGCAAAGGCtACAATAAATACCTAACTCTTTAATGCCTAGACACCAATGA 

1 5 AC AAG ACC ATCTAGGAAAACATGACCTGACCAAACAAACTAAATAAAGCATC 
AGGGACCAATCCTGAAGAAAAGAGATATGTGACCTTTCAGAGAGAGAATTTA 
AAATAGCTGTTATGAGAAAACTCAAATTCAAGATAACACAGCGAAGGAATTC 
AGAATTCTATCAGATAAATTTAATAAAGAGATTAAAATAAGAATCAAGCAGA 
AATTCTGGAGTTGAAAAATGCAGCTGACATATTGAAGAATGTATCAGAATCT 

20 CTTGATAACAGCATTGATCAAGAACAAAAAAGAATTAGTAAGCTTGAAGACA 
GGATATTTGTAAATACACAGAGGAGACAGAATAAGAACGAATGAAGCATGA 
TCTAGAAAATAGCCTCAAAAGGGCAAATCTAAGATTTATTGACCTTAAAGAG 
GAGGTAGAGAAAGAGATAGGGGTAGAAAGCTTATTCAAAGGGATACTAACA 
TAGAACTTCCCAAACCTAGAGAAAGATACCAATATTCAAATACAAGAAGATT 

25 CTAGAATACC AAGC AGATTTAACCCAAAGAAGACTACCTCAAGGTATTTAAT 
AGTCAAAGTTCCAAAGGTCAAGGATAAAGAAAGGATCCTAAAAGCAGCAAG 
AAAAAAAAGACTTATAATTGAACTCCATTATATCTGGCAGCAGAGTTTTCAAT 
GGAAACTTTATGGGCCAGAAGAGAGTGK3CAGGACATATTTAAGGTGCTGAAG 
GAAAAAAAAGAAAAAAACTTTTGCTCTAGAATAGTATATCTGGTGAA/i \TAT 

30 CCirCCAACATGAAGGAGAAATACAGACCTTCCCAGACAAACAAAAGCTGAG 
GATGTCATCAACACCAGAGCTGTCTTACAAGAAATGCTAAAGGGAATTCTTC 
AATCTGAAAGAAAAGGGTGTTAATAAGCAACAAGAAATGATGTGAAGGTAC 
AAAACTCACTGGTAATACATACACAGAAATACAAATAGTATTATAACACTGT 
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AATTATGGTGTGTAAACTACTCATATATTAAGTAGAAGGATGAAAAGAAGAA 
CTGATAAATAATGACTACAACAACTTTTGCGGTACAATAGGCAGTACAATAA 
GATATAGAGACAAAAAAACTTAAAAAGCGGAGAGACAAAAGTTGAAGTGTA 

5 TGTTACATACGTATACAGTGCCATGGTGGTTTGCTGCACCTATCAACCCATCA 
TCTAGGTTTTAAGCCCTACATGCATTAGGTATTTGTCCTAATGCTCTCCCTCCC 
CTTGGCCCCCATCTCCCCACAGATCCCGGTGTGTGATGTTCCCCTCCCTGTGTC 
C^T^TATTCTOATTG TTCAACTTCCACTTATGAGTGAGAACA TATGGTGTTTG 
GTTTTCrGTTCCTGTGTTAGTTTGCCGAGAATGGTTTCCAGCTTCATCCCCACA 

GTGCCACATTTTCTTTATCCAGCCTATCATTGATGGGCATTTGGGTTGGTTCCA 
AGTCTTTGCTATTGTAAATAGTGGTGCAGTAAACATATGTGTGCATGTGTCTT 
TATAGTAGAATGATTTATAATCCTTTGGGTATATACCCAGTAATGGGATTGCT 
GGGTCAAATGGTATTTCTAGTTCTAGAATCTAGTTCTAGAATCACCACGCTGT 

15 CTTCCACAATGGTTGAACTAATTTACACTCCCACCAACAGTGTAAAAGCGTTC 
CTATTTCTCCACATCCTCTCCAGCATCTGTTGTTTCCTGATTTTTTAGTGATCG 
CCATTCTAACTGGCATGAGATGGTATCTCCTTGTGGTTTTGATTTGCATTTCTC 
TG^ATGACCAGTAATGATTAGCTTTTTTITCCATATGTTTCTTGGCCACATAAAT . 
GTCTTCTTTTGAGAAGTATCTGTTCGTATCCTTTACCCACTTTTTGATGGGGTT 

20 GTTTTTTTCTTGTAAATTTAAGTTCCTTGTAGATTTTGGATATTAGAC 

AGATGGATAGATTGCAAATATTTTCTCCCATTCTGTAGGTTGCCTATTCACTCT 
GATGCTAGCTTCTTTTACTGTGCAGAAGCTCTTTAGTTTAATTAGATCCCATTT 
GTCAATTTTGGCTTTTGTTGCAATTGCTCTTC 

GCCCATGCCTATGTCCTGAATGGTATTGCCTGGGTATTCTTCTAGGGTTTCCGT 
25 GGTTTTAGGTTTTACATTTAAGTCTTTAATCATCTTGAGTTAATTTTTATATAA 
GTTGTAAGGAAGGGGTCCAGTTTCTGTTTTCTGCATATGACTAGCCAGTTTTC 
CCAGCACCATTTATTAAATAGAGAATCCTTTCCGCATTGCTTTTTTGTCAGGTT 
GGTCAAAGATCAGATGATTTTAGGTGAGTGGTGTTATTTCTGAGGTCTCTGTT 
CTGTTCCAiTGGTTTATATATCTGTCTTCATACCAGTACCATGCTGTTTTCGTT 
30 ACTATAGCCnTGTAGTATAGTTTGAAGTCAGGTAGTGTGATGCCTCCAGCTTT 
GTTCTTTTTACTTAGGATTGTCTTGGCTATATGGGCTCTTTTTTGGTTCCATAT 
GAAATTTAAAGTAGTTTTTTCTAATTCTGTGAAGAAAGTCAATGATAGCTTGA 
TGGGAATAGCATTGAATCTATAAATTACTTTGGGCAGTATGGCCATTTTCATG 
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ATACTGATTATTCCTATCCATGAGCATGGAATTTTTTTCCATTTGTTTGTGTCC 
TTTCCTTTCCTTCAGCAGTGGTTTGTTGTTCTCCTTGAAGAGGTCCTTCACATC 
CCTTGTAAGTTGTATTCCTAGGTATTTTCTTCT 

GGATTCACTCATGATTTGGCTCTCTGCTTGTCTATTTTTGGTGTATAGGAATGC 
5 TTGTGAATTTTGCACATTGATTTTGTATCCCAAGACTTTGCTGAAGTTGGCTAT 
CAGCTTAAGGAGTTTTTGGGCTGAGACAATGGGGTTTTCTAAATATACAATCA 
TGTCATCTGCAAACAGAGAAAATTTGACTTCCTCTTITCCTATTTGAATACTCT 
TTATTTTTTCGCTTGCCTTATTGTCCTGGCCAGAACTTCCAATACTATGTTGAA 
TGGGAGTGGTGAGAGAGGGCATCCTTGTCTTGTGCTGGTTTTCAAAGTGAATG 
10 CTTCCAGCHTITGCCCGTTCAGTATGATATTGCCTGTGGGTTTGTCATAAATAG 
CTCITGTTATTTTGAGATATGTTCCATCAATTCCTAGTTrATTGAGAG 
GCATGAAGGGATGTT^AATTTTGTCAAAGGCTTTTTCTGCATCTATTGAGATA 
ATCATGTGGTTTTTGTCATTGGTTCTGTTTATGTGATGTTTA^GTGATTTACATA 
TGTTGAATCAGCCTTGCATCCCAGGGATGAAGCCCACTTGATCACCATGGATA 

15 AGCITTTTGATGTGCTGCTGGATTCAGTTTGCCAGTATTTrA 

GCATCGGTGCTCATCAGGGATATTGGCCTGAAGTTTTCTTTTITTGTGTGTCGC 
TGCCAAATTTTGGTATCAGGATGATGCTGGCCTCATAAAATGAGTTAGGGAG 
GGGTCCCTCTTTTTCTACTGTTTGGAATAGTTTCAGAAGGAATGGTACCAGCT 
CCTCTTTGTGCCrCTGGTAGAATTrGGGTGTGAATCCGTCTGGTCCTGGGCTTT 

20 TTTTGGTTGGCAGGCTATTAATTACTGCCTCAATTTCAGAACTTGTTGTTGGTC 
TATTCAGGGATTCGACTTTTTCCTGGTTTAGTCTTGGAGGGTGTATGTGTCCAG 
GAATTTATCCATTTTTTCTAGATTTTCTAGm 

GTATTCTCTGATGGTAGTTTGTATTTCTGTGGGATCAGTGGTGATATCCCCTTT 

ATCATTTITrATTGCATCTATTTGATTCTTCTCTCTTTT 
25 GCTGGCAGTCTATTTTGTTAATCTTTTCAAAAAACCAGCTCCTGGATTCATTG 

ATTTTTTTGGAAGGGTTTTTCATGTCTCTATCTCCTTCAG 
AGTTATTTCTTGTCTTCTGCTAGCTTTTGAATTTGTTTGCTCTTGCTTCT 
TCTTTTAATTGTGATGTTTGGGTGTCGATTTCAGATCTCTCTAGCTTTCTGATA 
TGGGCATITAGTGCTATAAATTTGw'CTCTAAACACTGCTTTAGCTGTGTCCCA 
30 GAAATTCTGGTACGTTGTCTCTTTGTTTTCATTTGTTTCAAAGAATTTCTTT ATT 
TCTGCCTrGATTTTGTTATTTACTCAGTAGTCATTCAGTAGCAGGTTGTTCAAT 
rrCCATGTATCTGTGTGGTITTTAGTGAGTTTCTTCATCCTGAGTTCTAAm 
ATCGTACTGTGTTCTGAGAGACTGTTrGTTATGATTTCCATTCTTTGGCATTTG 
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CTGAGGAGTGTTTTACTTCCAGTTATGTGGTCAATTTTAGAATAAGTGGTATG 
TGATGCTGAGAACAATGTACATTCTCTTGATTTGGGGTGGAAAGTTCTGTAGA 
TGTCTATTAGGTCCGCtTGGTCCAGAGCTGAGTTCAAGTCCTGAATATCCTTG 
TTAATTTTCTGTCTCATTGATCTGTCTAATATTGACAGTGGAGTGTTAAAGTCT 

5 CGCACTGTTATTGAGTAGGAATATAAGTCTCTTTGTAGGTCTCTAAGAACTTG 
TTTTATGAATCTAGGTGCTCCTGTATTGGGTGCATTTATTTAGGATAGTTAGCT 
CTTCTTGTTGCATTGATCCCTTTACCATTATGTAATGCTCTTCTTTGTCll 11 II 
TATCTTTGTTAGTTTGAAGTCTGTTTTATCAAAGGCTAGGATCTTGGCCGGGTG 
CAGTGGCTCATGCCrGTAATCCCAGCACTTTGGGAGGCTGAGGTGGTTGGATC 

10 ACGAGATCAGGAGATCGAGACCATCCTGGCTAACACGATGAAACCCCATCTC 
TACTAAAAATACAAAAAAATTAGTCAGGCATGGTGGCGGGCACCTGTAGTCC 
CAGCTACTCGGCAGGCTGAGGCAGGAGAATGGCGTGAACCCGGGAGGTGGA 
GCTTGCAGCGAGCCAAGATAGCGCCACTGCAGTCTGGCCTGGGCAAAAGAGT 
GAGACTCCATCTCAGAAAAAATAAAAAATAAAAAATAAAATTAAAAAAAAT 

15 GGCTAGGATC AC AACCCCTGCTTTTTTTTTTTTTTTTTTTT^ 

TGCTTGGTAAATATTCCTCCATCCCTTTATTTTGAGCATATGTGTGTCTTTGCA 
CATGAGATGGATCTCCTGAATAAAGCACACTGATGGGCCTTAACTCTTTATCC 
AGTTTGCCAGTCTGTGTCTTTTAATTGGGGCATTTAGCCCATTTACATTTAACG 
TTAACATTGTTATGTGTGAATTTGGTCCTGTCGTCATGATGCTAGCTGGTTATT 

20 CTGCACATTAGTTGATGCAGTTTCTTCATAGTGTCATTGGTCTTTATATTTTGG 
TGTGGTTTTGCAGTGGCTGATACTGCTTTTTCCTTTCCATGTTTAGTGCITCCT^ 
CAGGAGCTCTTGTAAGGCAAGCCTCAGCATTTGCTTGTTTGGAAAGGATTTTA 
TTTATCCTTTACTCATGAAGCTTAGTTTGGCTGGATATGAAATTCTGCTTGAAA 
ATTCTTTTCTTTATGAATGTTGAATATTGGCTCCCACTCTCTTGTGGCTTGTAG 

25 GGTTTCTGCAGAGAGATTTGCTGTTAGTCTGATGCACTTCTCTTTGTAGGTAA 
CCTGACCTTTCTCTCTGGCTGCCCTTAACGTTTTGTCCTTCATTTAAACCTTGG 
AGAATCTGACAATTATATGTCTTGGGGTTGCTCTTCTTGAGGAGTATCTTAGT 
GGTGTTCTCTGTATTTCCTGAATTTGAATGCTAGCCTGTCTTGCTAGGTTGGGG 
AAGTTCTCCTGGATAATATCCTGAAGTGTGTTTTCCAAC.TGGTTCCATTCTCC 

30 CCGTCACTTTCAGGTACAGCAATCAATTGTAGGTTTGGTCTTTTCATATAGTCC 
CATATTTCTTGGAGGTTTTGTTCGTTCTTTTTCATCATTTTTTCTCTAATC 
CTGCATGCCTTATTTCGTTAAGTTGATCTTCAATCTCTGATATCCTTTGTTCCA 
CTTGATTCATTTGGCTATTGATACTTGTGTATGCATCACAAAGTTCTCATGCTG 



WO 02/054069 PCTAJSOl/49808 

218 

t 

TGTTTTTCAGCTCCATGAGGTCATTTATGTTCCTCTCTAAACTGGTTATTCTAG 
TTAGCAGTTCCTGTAACCTTTTATCAGGTTCTTAGCTTCCTTGCATTGGGTTAG 
AACATGCTTCTCrAGCTCAGAGGAATTrGTTATTACCCACCTTCTGAAGCCTA 
CTTCTGTCAATTCGTCAGTCTCATTCTCCGTCCAGTTTTGTGCCCTTGCTGGAG 
.5 AGGAGATGTGATCATTTGGAGGAGAAGAGGCATTCTGGTTTTTAGAATGTTC 
AGCATTITTGTGCTGGTTTITCCTCATCTTrGTGGAm 

TGAGGCTGATGACCTTTGGATGGGGTTT^GTGTGGGGGGTCCCTTTTGTTGA 
TGTTGATTTTGTTGCTTTCTGTTTGCTAGCTTTTCTTCTAAGAGTCAAGCCTGC 
~ AGTCCTGCAGGTCTGCTGCAGTTTGCTGGAGGTCCACTCCAGACCCTGTTTGC 
10 CTGGGTATCACCAGTGGAGGCTGCAGAACAGCAAAGATTGCTGCCTACTCCT 
TCCTCTGGAAGCTTTGTCCCAGAAGGGCACCAGCCTGATGCCAACCAGAGCT. 
CTCCTGTTTGAGGTGTCTGTTGACCCCTGTTGGGAGGTCTTGCCTGCTCAGGA 
GGCATGGOGCTCAGGGACCCACTTGAGGAGGCAGTCTGTCCCTTAGCAGAGC 
TCATGCACTGTGCTGCGAGAATCCCCCTTGTCAGGATCAGCTGCTCTCTTCAG 
15 AGCCAGCAGGCAGGAGAGTTCAAGTCTGCTGAAGCTGTGCCCACAGCCACCC 

cttctccaggtgctctgtcccagggaaatgggagttttatctataatcccctg 
actggggctgctgcctttctttcagagatgccctgcccagtgaggaggaatct 
agagaagcagtctggcctcagctgctttgctgtgctttggtgaat^ 
gtccaagcctcccagcctccttagcactatcaggggaaaaccgcctactaaa 

20 GCCTCAGTAATACCAAATGCCCCTCCCGCCACCAAGCTTGAGCATCCCAGGTC 

aacttcagactgctgtgctggcagtgagaatttgaagccagtggttcttagct 

TGCTGGGCTCCATGGGAGTGGGACCCACTGAGTGAGACCACrTGGCTTGCTG 

GCCTCAGCCCCCTTTCCAGGGK3AGTAAATGATTCTGTCTTGCTGGGGTTCCAG 

GCACCACTGGGGTACAGAAAAAACTCCTGCAGCTAGTTCAATGTCTGTCCAA 

25 • ACAGCAACTCAGTTTTGTGCTTGAAACCCAGGGCCCTGGTGGTATAGGCACAT 
GAAGGAGTCTCCTGATCTGCAGATTGCAAAACCCATGGGAAAAGCATAGTAT 
CCAGGCCGGGTATCACAGTGCCGGGTATCACAGATGACGGGTTGATGGGTGC 
AGCAACTGCATGCAAACTGCATGTTTGCAGTTCCCTCGGCTGGGGGAGGGAG 
. GTCTCCCGGCTCCTTGCACTTCCCAGGTGAGGCGATGCCCCACCTGCTTCTGT k 

30 GTGCCCTCCGTGGGCTGCACCCACTGCCTAATCAGTCCCAATGAGATGAACTT 
GGTACCTCAGTCAGAAATGCAGAAATCACCCAGTGTTGATCTCACTGACAGC 
TACAGACTGGAGCTCTTCCTATTCTGCCATCTTGCCAGATGTAAAGTGTAGAA 

TTTTTGATTITGTTTTTTTGCTTGTTTATGCAATT 
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TTAAAATAATGAGTTGTAAGATATTTTCAAGCCTCATGGTAACCTCAGATCTA 
AAACATATGGCAGACACACAAATAAAAAGCAAGAAATTAAAGCATGCCACT 
AGAGAAAATCACCTTCACTAAAAGGAAGACAGAAGGGAAGTAACAAAGGAA 
GAGAATATCTTCACCAGAAAACAAATAATAAAATGGCAAGAGTGAGTCCTTA 
5 CGTATCAATAATAATATTAAATGTAAATGGGCTAACCTTGCCAATCAAAAGA 
CAGAGTGGTTGAATGGACCCCCGCCCCCACAAAAAAAGACCCAATTATCTGT 
TGCTTACAAGAAACACATTTCAACCTATAAAAATACACATAGACTGAAAATT 
AAGGGCTTGAAAAAAGATATTCCGTGCCAATAGAAACCAGAAAAGATCAGG 
AGTAGCTATACTTGTATTAGGCAAAGTAGAGTTCAAGATAATAACTATAAGA 

10 AGAGACAAAGAAGGTCATTACATCATGATAAAGGGATCAATTCAGCAAGACG 
ATATAACAATTTCAAAGAAACATCTAACTTAATCTGCACTGTAGACCAAATAT 
ACCTAATAGATATTTACAGGACATTTCATCCAGCAATTCCAAGAGAAGAATA 
CAC ATTCTCCTC AGC ACAT AGCTC ATTCTCAAGGATAG ACCATATGTT AGATC 
ACAAAACAAGTCTTGAAACATTCTAAAAAATTGAAATACTATCAAGCATCTT 

1 5 CTCTGACCAGTACAGAATAAAACTAGAAGTTAATAATGAGACATTTTGGAAA 
CTATACAGACATACACATAGTAATTAAACAATATGCTCCTGAATGACCAGAG 
GGTCAAGGAAGAAATTAAGAAGGAAATTGAAAATTTTATTGAAATAAATTAC . 
AATGGAAACACAACTTACCAAAACCTGTGGGATTTGTTTGTGTCTTTTTTACT 

TCCTTGAGCAGTGGTTTGTAGTTCTCCTTGAAGATATGAACAGACACTTCTCA 
20 AAAAAAAGACATTTATGCAGCCAAAAAACATATGAAAATGTGCTCATCATCA 
CTGGTCATTAGAGAAATGCAAATCAAAACCACAATGAGATACCATCTCACAC 
CAGTTAGAATGGCGATCATTAAAAAATCAGGAAACAACAGATGCTGGAGAG 
GATGTGGAGAAATAGGAACGCTTTTACACTGTTGGTGGGAGTGTAAATTAGT 
TCAACCATTGTGGAAGACAGTGTGGCAATTCCTCAAGGATCTAGAACCCAAA 
25 TACCATTTGACCCAGTGATCCCATTACTGGGTATATACCCAAAGGATTATAAA 
TCATTGTATTATAAAGACACATGCACACATATGTTTATTGCACCACTATTCAC 
AATAGCAAAGACTTGGAACCAACCCAAATGCCCATCAATGATAGGCTGGATA 
AAGAAAATGTGGCACATATACACTGTGAAGTACTATGCAGCCATAAAAAAAG 
ATGAGTTTATGTTarrTTCAGGGACACGGATGAAGCTGGAAACCATCATTCTC 
30 AGCAAACTAACACAAGAACAGAAAACCAAACACCACATGTTCTCACTCATAA 
GCGGGAGTTGAACAAGGAGAACACATGGACACAGGGAGGGGAACATCACAC 
ACCGGGGTCTGTTGGGGGGTGGGGGGCAAGGGGAGGGAGGGAATTAGGAGA 
AATACCTAATGTAGATGACGGGTTGATGGGTGCAGCAAACCACCATGGCACG 
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TGTATGCCTATGTAACAAAACTGCATGTTCTGCACATGTACCCCAGAACTTAT 
AAAAATCCTGTGGGATACAGCAAAAGCAGTATTAAGAGGGAAGTTTCTAGTT 
AAAAGCACCTGCATCAATAAAGAGAAAAACTTCAAGTAAACAACCTAATGAT 
CTTAAAGAACAAGAAAAGCAAGAGTAAGTCAACCTAAAATTAGTAGAAGAA 
5 ATAATAAAGCTTAGAGCAGAAATACATGAAATTGAAGTGAAGAAAACAAGA 
CAAAAGATCAACAAAATAAAAAGTTGGTX'lTIllllITAAAAGATAAAATTGA 
CAAACCTTTAGCCAGACTAAGAAAAGAAGAGGGAAGATCCAAGTAACTAAA 
GTCAGAGACAAAAAAGGAGACATTACAACTGATACCACAGAAATTCAAAGG 
ATAATTAGTGGCTGCTATGAGCAACTCTATGCCAATAAATTGGAAAATCTGG 
10 AAGAAATGGATACATTCCTAGACATGTGCAACCTACCAAGATTGAACCAGGA 
AGAAATGCAAACCTGAACAGACCACTAAGCAGTAATGAGATTGAACCTGTAA 
TTTAAAAATCTTCCAACAAAGAAAACCCTGAGACITGATGGCCTCACTGCTAA 
ATTCTGCCAAAGATTTCAAGAACTAATACTAATCCTTCTGAAACTATTATGAA 
AAATAGAGGAGGAGGGAATACCTCCAAACTCATCTACAAGGCCCGTATTACC 
15 CTGTTACCAAAACCAGACAAAGATACATCAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAGGAACCACGGGCCAATATCCCTGATAAATATTGATGCAAAAATCC 
TCAACAAAATACTAGCAAAGCAAATTCAGCAACACCTTAAAAAAGATCACTC 
TTCATGACCAAGTGGGAGTTATCCCAGGAATGCAAGGATAGTTCAACATATG 
CAAATCACTTTATGTGATACATCGTATCAACAGAATGAAGGACAAAACCACA 
20 TGAGCATTTCAATTGATGCTGAAAAAGCACTTGATAAAATTCCACATCCCTTC 
AACATAAAAACTGTCAAAAACCTAGGTATAGAAGGAACATACCTCAACATAA 
TAAAAGCCATATATGACAGACCCACAGCTAGTAACATACTGAATGGGGAAAA 
ACTGAAGGCCrTTCCTCTAAGACCTGGAACATGACAAAGATGCCCACTTTCAT 
CACTGTTATTTAACACAGTAGTGGAAATCCTAGCTAGAGCAATCAGACAAGA 
25 GAAGAAAATAAAGGGCATCCAAGTTGGAAAGAAAGAAGTCAAATTATCCTTG 
TTTGCAGATGATGTGGTCTTATATTTGGAAAAATCTAAAGACCCCACAAAAA 
AACTATTAGAACTGATAAATTCAGTAAAGTTGCAGGATACAAAGTCAACATA 
CAAAAATCAGTAGCATTTCTATATGCCAAGAGTGAACAATCTGAAAAACAAA 
TCAAGAAAGTAATCCCATTTAAAATAGC^\CAAATAAAATACCTAGGAATAA 
30 ACITAACCAAAGAAGTGAGAGATCTCTACAATGAAAATTATAATACACTGAT 
GCAAGAAATTGAAGAGGACCCAAAAATGGTAAGATATTTCATGTTCATGGAC 
TGGAAGAATCAATATTGTTAAAATGTCTGTATTGTCTAAAGCAATCTACAGAT 
TCAGTGCACCCCCTATCACAATACCAATGACATTCTTCACAGAAATAGTAAAA 
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ACACTCCTCAAACTTATATGGAATCACTAAAGTCCCAGAATAGTCAAAGCTAT 

CCTGAGCAAAAGAACAAAACTGGAGGGAGCACATTACCTGAGTTCAAATTAT 

ACTACAGAGTCATAGTAACCAAAACAGTATGTACTGACATAAAAACAGACAC 

GTAGACCAATGGAACAGAATAGAAAAGCCAGCAAACAAATCCATATGTCTAC 

GACAAACTCATTTTTGACAAAGGTGCCAAGAACATGCATTGGGAAATGGACA 

GTCTCTTCAATAAACGATGCTGGGAAAACTGGATGTCCATATGCAGAAGAAT 

GAAACTAGACCCCTATCTCTCACCATATAAAAAAAACAAATCAAAGTGAATT 

CAAGAATTAAATCTAAGACTTCAAACTATGAAACTACTAAAAAGACATTGGA 

GAAACTCTCCAGGACATTGGAGTGGGCAAAGACTGCTTGAGTAATCCCTACA 

AGCACAGGCAACCGAAGCAAAAATGGACAAATGGGATCACATCAAGTTAAA 

AAGCTTCTGCACAGCAAAGGAAACAATGAGACAGCCTGCAGAATGGGAGAA 

AATATTTGCAAAGTGCCCATCTGACAAGGGACTAATAACCAGAATATGTAAG 

GAGCTCAAACAACCGCACAAGGAAAACATCTAATAATCCAATTAAAAGATGG 

GCAAAAGACCTGAATAGACATTTCTCGAAAGAAGACAAATGGATGAAACTGG 

AAACCATCATTCTCAGCAAACTATCGCAAGGACAGAAAACCAAACACCGCAT 

GTTCTCACTCATAGGTGGGAACTGAACAATGAGAACACATGGACACAGGAAG 

GGGAACATCACACACTGGGGACTGTTGTGGGGTTGGGGGGAGGGGGGAGGG 

ATAGCATTAGGAGATATACCTAATGCTAAATGACGAGTTAATGGGTGCAGCA 

CACCAACATGGCACATGTATACATATGTAACAAACCTGCACATTGTGCACAT 

GTACCCTAAAACTTAAAGTATAATAATAATAAAAAAACAAATGACAAATAAA 

TAAATGAAAATGTACTCAACGTCACTGATCATCAGAGAAATGCAAATGAAAA 

CTATAATGAGATATGACCCCACCCTGGTCAATATAGCTTTTATCCAAAAGATA 

GGCAATAATGGATGCTAGTGAGGATGTGGAGAAAAGGGAATCCTTGTACACT 

GTTGGTGGGAATGTAAGTTACTACAACCACCGTGGAGGACAGTTTCGAGGTT 

CCTCAAAACTAAAAATAGAGCTACCTCTGATCCAGCAGTCCCACTCTAGTTAT 

ATATCCAGAAGAAAGAAAATCAGTCCAGCAGAGAAGGATCCGCACTCCTGTG 

TTTATTGCAGCACTATTCACAATAGCCAAGATTTGGGAGCAAACTGAGTGTCC 

ATCAACAGATGAATGGATAAAGAAAATGCTGTGTATACACGCAACGGTATAC 

TAGTTAGCCATGAAAAAGAATGAGAACGTGTCATTTGCAACAAi'ATGGATGG 

AACTGGAGGTCACTATATTAAGTGAAATAAGCCAGGCACAGAAAGACAAACT 

TTGCATGTTCTCACTTATTTGTGGGAGATAAAGATTAAAACAATTGAACTCGA 

GATAGAGTAAAAGGATGGTTACCAGAGGCTGGGAAGGGTAGTGGGGGTTGT 

GAGGGAAGTAGAGATGGTTAATGGAAACAAAAAAAAAATAGAAAAATGATT 
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AAAACCTGGTATTTGATAGCACAACAGGGTGACTATAGTCAATGATTTAATTA 
TACATTTAAAAATAACTAAAAGAGAGCTGGGCACAGTGGCfCACAGCTGTAA 
TTTCAGCACTTTGGGAGGCCGAGTCATATGACTCACTTAAGGCCAGGAGTTCG 
AGGCCTGCCTGGCCAACATGGTGAAACCCCCTTCTCTACAAAAAACAGGAAA 

5 ATTAGCTGGACTTGGTGGCATACACCTGTAATCCCAGCTACTTGGGTGGCCGA 
GGCATGAGAATCACTCGAACTCAGGAGGCGGAGGTTGCAGTGAGCTGAGATC 
GCGCCACTGCACTCCAGCCTGGGTAACAGAGTGAGACTGTCTCTAAATAAAT 
AAGACGAAAAGAGTATAATGAGGTTGTTTGTAAGACAAGGGGTAAATGTTTG 
AGGTAGTAGATACCCCCATTTACCTCTATGTGATTATTATGTACTGTCTGTCTA 

10 TGTTAGAATATCTCATGGACCCCATAAACCTATACACCTACTATGTACCCACA . 
AAAAGCAAAAAGAAAAAGAGCTTATGGGTATCAATTTTTCTGTTAGTAACGC 
TTTTGTTGTGTCCCAGCAAGTTTTGGTACATTGTCTTCTCATCTTTAm 
TAAGTATTTTCTAATTTCCCTTCTGATTTTTTTTTTTTTTAAm 
ACACATGTGCAGGACATGCAGGTTTGTTACGTAGGTAAATGTGTGCCACGGT 

15 GGTTTGCTGCACCTGTCAACCCATCACCTAGGTGTTAAGCCCTACATGCATTA 
ACTATTTATACTGATGCTCTCTCTCCCTCTGCCCCCCTTTTTCTTTGATTCATTG 

GTTAAGAGTATGTTGTTrCATTrCCATACATTTGTTAATTm 
TATTATTGATTTCTAAGCTTATCCCATTATCGTTGGAGAAGATCCTTTGTATGA 

CTGGCATCTTTTTGAGTACTTTGAGACITAATTCGTGGCCTGATCTCTCCT 
20 AACATTTCCACGTGCACTTGGGATGCACATGCATGCTGTGGTCATGGGTCGAG 
TGTTCTCTGCATGTTGGTTAGATCTAGGTGGCTTATTGTGTTAAGTCCTCTATT 
CTGTAACTTATCTTCTGCCTGGTTGTTCTATTCATTATAGAGAATGTGGTACTG 

AAGTCTCTCACCATCATTGTAGAACT^^ 

CCTTTGTGTCTTTTGATCGTTAGCTGTTAGAGGCATAAATGTTTATAATGGTTA 
25 TCTGTAACATCTTCTTGCAGCCTCGAACCTTTAAGTGTGCAGCGTCCTCTTTTG 
TCTCTTGTGACCTTrCTAGGTTTATTATAAAACCATTITrGGCTG 
TAGCCACTCTGCTTTTTGTGTTTGCCATTTGCATGGAACCTCTTTCTCCACTCT 

TTCCACCCTITCACTTTCAATCTTTTrGTGTCTTTGGCTCT 

AGT li gac agc atgt agctgg atc atgttt atct attctgccactgccgatatc 

30 ACCTTTTGATATACAGACnTrACATTTTCATAAAATTTAATTTGTCCACGTGCT 

tagcccctgtgatccagcaatcccactgctagttatatatccaaaaagaaag 

AAAATCAGTCCAGCAGGGACAGATGCGCACTCCTACGTTTACTGCGGCACTG 

ttcaccggagcctgtgccttcaatgtcgtattcaagaaatcatcctcaaatcc 
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AAAGTCTTTCTATTTTCTCCTATGTGTTCCTATTGCTTTAGGTCTTACATTTATA 

CCTTTGACCCATATTGAGTTAATTTTTTATATCCAGCTTTATTCTTTTGCATGT 

AGATACTCAGTTATCCCACCATCATTTGTCGAAAAGACTATACGTGAACTTTT 

TGAATTTGCTTTTCTCACGTAATAGTATTTCCTAGAGACCACTACATATCAGTT 

CGTGGGGTCCTTCCATATTCTTTTTTACAGCCACCCCATACTTTATGGTTGGAT 

GTGCCATGCTTATTCAAACCCTGTCCTATTTATGGGCATTTAGGTTGTTCTGAT 

ATTGTGCAATGATAAATAATGTTGCAAAGTATAGCCTTATACATATGTCTATT 

GTATTCTTGAAACAATGTTGCAAAGAATAACTTCGTAGGTATGGATGTTCCTA 

TTGTTGGAGGTGTTATCTTCAGGGTAGATTCCTAGAAATGGGAGTGTTGGGCT 

GAAAGAGAAGCCCGTAGTTTTGTTGGATGTGACCAAATTTCCCTGCAGAACG 

GTTGGACATTTGCACTCCCCCAGCAATGCATGAGCATACCGGTTTTTTCTACA 

GTTGCGCCAGCAGCGTGCTGTCACACCACTTTTTGCCAGGTCCATAGCTTTGT 

TTTAATCTGTATTTCTGTAATTACATTGAATTTGAATGTTTTCCCTGTGTCTAA 

AAGTCATGTGTGTGTGTATTTGTAAATCGTCTGTCTTCTCATTTCCAACTTCTT 

CCATTGGGTTTTTGGTTCTTGGTCTCTCAATTTTTAAGTCATTTATATTGTGGG 

AACGTTTGTCTGGACTAATGTGAATTGTTTTCCCAGCTGGTCACTTGTCTTTGA 

CTTTGCTTATGGTGCTTTTTGCTATGAGAAATTTCGCATGTTTATATCATCAAA 

TTTATCATGCTTCTTTCTACTGACTCCGGATTCAGGGCCACAGTTGAGAGCTTT 

TTCCTCCACCGAGGTTAAGGGGAATCCACTGTGTTTTCTTCTGGAGGTTTGGG 

GTCTTGTCITCAGCATTTAGCTCCTTGTCCGTTTGGAGTTTGCTCTTGTGCATG 

ATCTAAGGTAATCTTTTCTCAGAGAGTGGCCTGAGACCTCTTCCTTAGAAGTT 

CACCCCACTGGCTGGGGCCTTCACGAGGCCCCTCTGGCCTCTAAGTGGAGCAT 

GGAGTGTGGGGTCATCGGGCAGGACACGAGATGCGCAGGAGGCTGTGACTG 

AGGTGGCTGTGGGATGGGACGGCGTGGTTTGCTCTAGAGGGTTTGAAGGAAG 

GGTGAGAAGTCAGGACCACTGCTGTAGAAAACAGGGTTTGCAGGTAGATAAG 

GTGGCGGGTGCAGCGGGAGGAAGAGAAAAGGATGCTGCTGGGACCCCTCGC 

ACCTGGGAGGATGCACTTTCTGTTTCTGACATGGGTCAGGGTTGGGGCGAAG 

GAGTCCAGCTGGGGACATGTTATTTTGGAGGTGTTTGTGAGGCAGCCAGGTG 

AGGTGGGTTGAAGGCAGT^AGACCTGCAGGTGGGAGCTCTGGGGAGAGAGT 

GCGGGCTGGGCTCACAAGTTAGGGGAGGTGTCAGTGTGTAAATGACATGAGC 

GCAGGGGCTAGACAGGGCCTCCTGTAGCCTGGGTGGGTGGAAGGGCAGAGG 

GCAAGGGCCAAGTGATGGCAGCTGTGGAGGCTGAGGGCAAGGAAGGCAAGT 

CCGGGAGGTTGGGTGTCCTGCAGCCTCCACAGGGGCAATGGAGGAAGAACAA 
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AGGACTCTGTGTTGAGCCCTCCTGAGAGGTTTTGGAACTGGCCTTGACTTGGT 
GATGTGGGGTTATGAGTTTTATTCCACACACGTATGTGGGAGTCTGGGTCCTG 
GTGGGTCCCAGCCCCGTACATGGAGGCTGGGACCCTGCCCTGAAACCTCTCA 
GACACAAACACCCAGCrCGGCCCCACAGAGCTGGCCTGGGCCCCAGCAGGGG 

5 TCTGTGTTGCCTTGGCCCCACGGCAGTGGAGCTGGGGTTTCCCAGGGATGCTG 
CTGCTGAAGGTGAGGTTTGCAAACACTAGGCATGCATGTTTGAGGTACCACA 
ACCTCCACCAACCCATGGTGGTCACCAAGGGTCTTTGGGCTCCTGTGGGGACC 
ATACTGGCTTGTCTGACTTGCCGTTACCCGGCGGAGATATGTCTTGGAAAGAC 
TTTAGGCCAGGAGCCCATCATTGTCCTGGAACCTGGGCTTGGCAGGGCAAGC 

1 0 TTTGGGGAGC ACTGTCCAGACAGTGCCCTGGACACTGC ACCTCAGGTCAGGA 
CACAGCTCCCGAGGCCTGGCTGCTCTGCATGGGGAGGAGTGTGGGTCCTGAG 
TGGAGTGTCCCATCTGTCATGCAGAGAGGGTGATGATATAACCAGCTGGGTG 
TTCGCACCTGAGAGGTTTCGGGGCAGGGTCCCAGCCTCCATGTACGGGGCTG 
GGACCCACCAGGACCCAGACTCCCAGACTAGC AGGTGGGCTTCTCAGAGCAC 

15 AGAGGGTGTGGGTGACCCAGATATCCTGTTGTTGCTCAGAGACCCCCAGGGA 
ATCCTTGAAGATGGCAGTGACCACCTGGCTTCCTCTATAGAAACTCCCCGAAT 
GGGAGGGATGGGAAAGACTGTTGAACTAAGCTTTTTAGGAAGAAAGCCATCA 
ATTCCCCTGCTCTCATACAGGCTTGAGGAGGGCTAAGGAAAGTCACAGTTGA 
ATGCATGCTCCCTCCACACCCATGTGCCTGATGCACACAGCCTGGCCCTTGTG 

20 GGCACTGGAGGGCACCCTACAGTCACAGAGGCAAACGAAGAATGTAGAGCT 
CCAGCTGGATGAGGGCCGTGGGGAAGGGGCACAGCCCTGGGCAAGGGTGTG 
CAGAGTCTGC ATGGGATTTGACTG AGCAATCTGGAAGGCTTCCTGGAGGAGG 
AGCTATTGCTCAGGGTGAAATAGAGGTCTTGAAATTAGTACAACACATGCCC 
CATACTGGGGAATTTGGGGGTCCCTGGGGGAAGGACTCTGTTACCAAAAGCA 

25 CACAGCACAGGTTGGGAGAGCAACTATCTCAAGGCTTTTATTTGCTGTTTCAT 
GAAACTGAAAATTTAAGTTTTTAATATCACATATTATTTCATGAAGAGTGGCT 
TCGGACTTTGGGGTAGGGTCTGGTAGGATCCCAGGGCCCCTTCCCTTGGCATC 
ATCATTGTTTCCCCTCCATGTTCAAGTTCAGCATCTCTGCACCGCTTGCCTGGC 
AAACCCCATCTTCCTGTCCAAAGCCAGACTTGGC?^CCTTTGCTGCAGTGAGT. 

30 GGCCGTTGGCTGGGAGGCTGGGTCTGTGCTGTGTGGGGGGTGTTTGGTGTCCT 
AGGAGGAGTCTCTGTCCTGAGCAGAGACAGAGGAAGGCAGGGCAGCCTGCTC 
AGAGGATGAGCCCAGCTATTCTGGGAAGCAGGCAGGAAGTGGCAGCTCCAG 
GTCCGGGTGTGAGGGCAGTAGTGCAGGTCGTGGTCCCAGATGGCCACGGAGG 
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TGTCAGCGTTTCACTCCCAGCACGTCACGCCACGTCCCTTTCCTCTGCTTTGTT 
CCTCCTCAGCTGTGAACTGTCTGGGACTGGGCCAGGAGACAGTCCTGGGATA 
GGGAAAGAAAAGAGGGGCTTCCGGGGGTGGGGAGCGGGGCTGGGCGGCTGC 
TGGCTGAGCCAAGTGGACTGGGAGGTGTGGACAGGGAACCCTGGCGGGGCC 

5 GGCCAATGGTGGCGTGATGCAGCCTGGCTTGGGAAATGCCCATGGAGGTGAC 
AAGTGACAGCCGGGAACAGACCCACTGCCAGGGCTGCCTCTCTCTTTAGATA 
TGTCCTGGCATCTGACTTTGGTTTCCTGGTGTTTAAAGGCTAATTGGGATCGG 
TCGTGGTGCGTCAGGCCTGTCAATCCAGTACACTGGGAGGCCAAAGTGGGAG 
GATTGCTTGAGCCTAGGAGTTCGAGACCAGCCTGGGCAACATAGCAAGACCC 

1 0 C ATCTCT AAAAAAAAAAATT AAAATATTAATACTAGCTGGGTGTGCTGGCAC 
ATACCTGTGGTCCCAACTACTCGGGAGGCTGAGGTAGGAGCATCACTTGAGC 
CCAGGAGTTCAAGGTTGCAGCAAGCTATGATCTTGCCACTGCACTCCAGCCTG 

agtgacagaacgagaccctgtgtctaaaaaaaaaaaatttttaaataaataa 
ataaaggtttgttggacatgttagctcacgcctgcaatctcagcatttgggga 

15 ggctgaggcaagtggattgcttgaggccaggagtttgagaccagcctgacct 
gcctgggcaacatggtgaaaccccgtctctactaaaaatacaAaaattagct 
gggtgtggtggctcacatctgtagtcccagctaccctggaggctgaggtggg 
aggatcacttgagcccaggagtagaggttgcagtgagctcagattgtgccac 
tgcactccagtctgggtgtcagaaccaggccctttctcaaaggaacaaacaa 

20 ggctaacatgttgggctttctctgcttctgggcaatgtccgtgaaattgaagc 
cgcactggtgttgttccacccttccttccagctggtctctgaggcccaggcca 
gatgcttctctggttagcagcagattctaggccagagcctgcctccaggtcct. 
gggaaggctggcacgctcagatgcactcctgggcgtggccttcctccgatca 
cagtgtgagcgaatctccctgcccatcactctgctcccctggcccagaccagg 

25 ccatgcctcagggcttctgcacgtgcagttttctctgcctgcagggtctgtcc 
ctagctctctacccgattcagatcccacaccagctcctggggcagccctcttg 
ccctgcctctccccgagacgtcaccctttctcttttccccattggtggtggttt 

TATGTGTTCATTTACGGTTTATTTTCTCTCTCCCTGTGTCTGTTATCTGTCTCTG 
CACCCCTGGAGCCCAGCGTGGTGCCAGGCACACACTACGTGCCTCCTGG£jTT 

30 gggggcagcagggacaaaagccaggcatgggatgtggctcttgtgggatgc 
accttgggtgtgactgtcacggaaaaggaggagacgtggggtggggaaggg 
ggcagcccctactgttcctggcagcagggccagaattgagagccagaggttg 
ccactggagggcttcctggggaggcagctggcaccgcctgattcctgtttctg 
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AATGeCCTCTCACTCTCCCCCACCCCCATGGAACCAGAACCACCTCGTCAGGC 
CCAGCAGCCCTTCAGGGTCAGTCTCTGCATGGGTGGTAATATGGCTTACCTGA 
GTCCTGGTCAGTCTGTGGCCTGGGAGTCCACCCTCCATGACTAGTGGTTTGTG 
GCAGGTGACGGGTGAGTGGTCTCCATGCTCCCAGCCGCTGTGTAGAGGTGCC 
5 AACCTGCCCCTCCTGCCAGAGCTGGGCTCCTGTTGGGCGGGACTGAGGATAC 
CGAGACAGAAATAAAACTGCTAAATGCATCTGTTGTTGAATAATTCAATGCC 
CCATGCCAGGCCTGTTGCAAAATTAGCCACTTCCTCCCTCCAGGCCCATGAGT 
GACTCCTGGATTTGGTGAAAGCCAGGCCTAGTGGAATTGTGGCCCGGAAGGT 
TCTCTGGGAGCTGGGAACCTCCATGCAGGAGCCCAGGGGAGGGGGTGCTTGG 
1 0 CC AC ATGCGGGCCCCTCAATCTGCCCTGTGGAGCCCGGCCTGGCCTGCAGCCC 
CCTGTGCAGGGAAAAACCCGCGGTTTTCCCTGGTGGAGGGCACAGGTCAGAG 
CGGGCATGGACAGGGGATGAGGAAGCAGTGCAGAGGAAGCGTGAGCATTTG 
TCTCGGGCCAGCTGGAGAAAGGAAATGGGGCTGCTTCCTGGAGGGCAGAGAC 
ATCAGGGGCTGCGCTGTCGGGCTCGGCGTGCAAGATTTGTGTCTCATCCCAGG 
15 GTTAGGTGCCTCCCAAATGCCAGGGCTTCCCATGGGCCATACACACTCAGTCG 
AACAAGATTTGCACAAAACCTGGGCCATAAGCTCGCTGAGCACTTGGCCTGC 
CCAGCCGGTGACAATGAGAACCGCACATGCTCAGCCTTCCCTGGACTCAGGG 
CCCTACGGGTGCTGGAGCCGGGGCGGGAGAGGGCCGCAGGCTGCCCCTGCCT 
CTGCATCCTCCCCGTCGATGCCTGGGGCCACTGGGATTGGCAGAAAAGGGAT 
20 GCCCCCGAGATGGGCAGCCAGGAGAACTGTCTGTCTCCTATCCCAATATGCCT 
ACCTCCCTCTCTGAATCCCCATTTCCCCCTCATCCTTGCCCCCACACACCGAGA 
TTTGCAATGGGACACACGGCCACCTGGGATGACAACCCCTCCCAGTCTTCCCT 
CAAGACAAGTGTGGCCATGACCGCATGGGGCCAGCAGGGTGTGAGTTGAAGT 
GTCCCGCAGTGGCTTCCGGGGATGTGCAGCCCTGCCCTGAGGCCCTCTGGCCG 
25 CCTCTTGCTCTCTGAGCAGGGATGTCCGTGCCTTGGGCCTGAGGCTGAGCTGG 
GCTAGGTCCCTGGGTGTCTGGGGCCACACTCTCTGGTCCAGCCTGCTGTTACA 
GGGGATGCCTCAGGGCCACCACTGGTCATGGAGGCTGGGGGCGCCTGCCTGT 
CTGCCCGCCTGGAGTTCAGGTAAACAACCCCACTGAGGAGGCACCCCTAGCC 
GTGGGAGTTG^tCGTGTCCTGCTCCCCAGGGCGGTCTGTCCCCTTGGCGAGATC 
30 CTTGCTGTGTCAGCAGAGGGTTTTCCTGGCCGAGGGTCTTGGGGAGCAGATGA 

tggggctttaggagctctggctctgggaaggcagtggaccctggagcaggga 
tcttgggaggatggagagttggcggctcaggaaggaggctgctcaagccccg 
gcctgctcactgggaagagcggtccttggaggagagagctcccgaggagaaa 
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GCCATGATGCTGTGCTGCGCTCTGGGCCACCAGGTCTTGTCTGGGGGGAATCC 
TGACCCTGTGTGATGTGAGTGCAGGGACTCCCCCACCCTCCGACCAGGACCTT 
TGTACTCTCCTGGCCGTGTCATTGCACAACTCCAGGGGCCGTCACTCACCTGA 
GCCCTCTTTGCTCTTCTTGGCTGTTCCCTGATATGGCCACGCAGAGACTATCTC 
5 TAAGTGTCAGTTGTGGACTCAGCTGGAAAGGTTTCATGACCTCAAGGGACCC 
ATGGGACTTGAGGGTGGGGCAGCTGTTTGCCCTGTGGATCAAAGGGGAGTGT 
CTGGGATGGGATCCACCTGCCCTGCCCTCAAAGTGCTGGGCAGTCCCTCAGA 
GCCATAGAGCCTAGGGGGTCTTGGGCCCTCTTCCCCAGGGCATGGCCCAGGC 
ACCTGCTGGCCTGAGGTCCCTGCTGGGCCCAAGGCATCCCTGCCGTGTTCAGA 

1 0 GTTCCACAGGCACCAGTCCACCATGCTGGGGCCTACTGGGAGAGACAGCCAC 
CCGCCTGCTGTCGGGAGCCTCCATCACCAGAGCCTGGGGGTGTCCCAGCCAC 
CCTGGCCGCTGTTTCTCTGCCTCCGCACCGGCCTCTCTCCTGAGTTCCATCCAT 
CCCTGCAGGTTGGGTTCTCCTCTCTCCTCTGGGATCTGCACAGGTTGCCATTCT 
CTCTGCCTAGACTGGGCCTCCACCTCTGAGAAGCCCTCCCTTATTGCCCCTTCC 

15 AGTGGGCTGGGAGAGGGAGGAAGGCCAGGTGCAGAAAGAGAATTCACCTGC 
CCGGCTAAAGCCCCACCTGCCTATTTCAAGGCCTCTGATGCCCCCCGGCCTGC 
AAGGTGTGTGCATGGGGCCTGTCCCATGCTGGCTCACCCAGCTGCTCAGAGA 
GGCAAGTGGCTCCCGGGCCCCACAGTGACACTGGGGGACTCTTTGTCTCCATT 
TGGGGGGTGTGCAGGTCACTCCTGGTGCCCTGGTCAGCTTCCCCAACCTGAGC 

20 TGCCCTCAGCTCCTGGCTTGGCCCTCCACGGAGTGGGAGATGCCCCCCTCCCC 
TGCCATTATCGTACCTTACCTCGCTGGCGGCCATGGAGGCAGTGAGAAGAGC 
CGAGGGAGCTGGCCGGGGCGCATTTCCGTGCAGGACACTGGCTGCCTCCCAG 
CTCACTCTTAGCCTGGTCATAGAGCCCGGCAGCATCTTGCCAGGTGCCTGTGG 
TCCCAGATCCTGGGGCTGCCCCATCTCTGGAATATTGAGGTGTGGCTGGGGCT 

25 GATCTGAGGATGAATGTGGCTCAGGTTGCCGGGTGCCCTTGACGCATTCCCAT 
GTCCCAGTGGGGAAACCAAGGCTCAAGGAGGCCACACAGCAGCTCCATTTCC 
AGGGACACCCAGTGTTCTCTGTAGTGGGTAGGAGGTTTCCCTATACAGGAGA 
CCAAGCTACAGCAAGAAAGACTTGGCCGGGCGAGGTGGCTCATGGGCGCGGT 
GGCTCACGCCT^fTAATCCCAGCACTTTGGGAGGCCAAGGGGATGACGAGGTC 

30 AGGAGATCAAGGCCATCCTGGCTAACACTGTGAAACCCCATCTCTACTAAAA 
ATACAAAAAATTAGCCCGGCGTGGTGGCGGGCGCCTGTAGTCCCAGCTACTC 
GGGAGGCTGAGGCAGGAGAATGGCATGAACCCGGGAGGCAGAGCTTGCAGT 
GAGCCGAGATTGCGCCACTGCACTCCAGCCTGGGCGACAGAGCCAGACTCCG 
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TCTCGAAAAGAGACTTGGGCTAGATGTGAGGAGGGACTTTCCCAAGTAGGGA 
TGAACCCTGAGGACCCACACCCTGAGCCGCCAGTGGGTCGTGTTGACCTGGC 
AGCTCCAGGGCACCTGAGGCCTTCGCTGTCCTTCCGCACCTCGCCCCCTTCCC 
GAGTGGAGGTGCTCTGGGAGCTCTGCCTGCTTCCTCAAGTCTTTCTTGTTTTTT 

5 TATGTTATTTTATTTTAGATTTGAGGAGTACACATACTTGTCTGTTACGTGCGT 
AATGGTGGGGTTTGGGCTTCTCGTGCGCCCATCAGCCGGATATTGGACGTTAT 
ACTCAATAGGTAATTTTTCAACCCTCCCCCTCTCACCCTCTCCACCTCCCCTTT 
GGAGTCCCAGAGTGACCTTTCTCCGTCTTTCTGTCCAAGCATCCCATCTGTCTA 
GCTTCCACTCATGAGTGAGAACGTGCCGTGTTTGGTTTTCTGTTTCCGGTAGTT 

10 CACTTAGGATCATGGTCTCCAGATCCATCCGTGTTGCTGCAAAGGACATGATT ; 
TCATTCACTTCTTGGAGGCTTTAACGCCTGCCCTGTGTGAGGGAGGGCGGAGC 
CCCTGGCCCAGGGGGGGGGAGGCCTGTCCACCCCTCCCACCCGTCCCCGAGG 
CCCCTGCTGCCCAGCAAGACCACCACCCTCTGCTCCAGGACAGCGCAGCCGC 
CATCAGTGGGGGTGAGGGGACAAGGCTGCTGGGACCAGGCCTCGCCGAGGCA 

15 GAGCCTCGCCATTTCTGGGGTGGGGGGTGCACAGGTGCATCTGGTCTGCCCA 
GCCCCTCCTCACCCTAGGCAGGCCTCTCACAGAGACAAGACGAGTGTCCTAG 
GGCAGATAGGGCCACATCCATGGGGTCACTTCAGTTTGTCCTCCCGGCGCCTT 
CTAAATATACTCTCTGTGGGTGTCTTATCACAGCCCAGGGAGCCAGGCTTGGT 
GTGGCCGGCCCTGGGAGCACAGAGCCCCGTTCCCAGCAGCCCTCGGCCCTGG 

20 GGGGCTTGTGCTCTGCCGGGCCACCTGCACCCTCCGGGCCCCAGGAGAAGCT 
CATGCCTTTCCTGCTTGCACCCCAGAGCCTGGCCACCCCTGTCCTCTGCTGAG 
GCCCTGACCACCTCTATGTGTGCCATGGCTGCAGTGACAGCCCAGACCCCAG 
GGGCCGGGCATCCCACAGCTCTCCTCACTGAGCCCCTGGGCCTATCCGCCCAC 
GCCCAGGCCAGGGCTCCTGCTGGAATCTCCCTGTGGAAGGCCAGGCTGCCTG 

25 GATGCCCACCACTCCAGGCTTACTTCTGGGTTGTAGTTTTCGGGGTGCTTCGA 
GGTGCTGTGGGGACCTGTGATGCTGTCAGGGAGACCCGGAGTCCAGTTCTCG 
CCAACCGCTCGATGCTGTGTGACTTGAGGCAGGTCCCTTAACCTCTCTGAGCT 
TCCTCCTCTCCTGTGAACAAAGGATGGGTCCACTTCTCAGGCTTCTCATGAAG 
•TTTAGAGGCGACAGGAGGAGGTGG^CTTTAAATGGCTCAGTCCCAGCGTCCC 

30 CTCATGCCTGGGACCGCTGACCCACCGGGATTCTTGGGGGCCACCATAGAGC 
CTGTTTTTAAAGGGGATGTGGCGGCCAGCTCTGGGGGCACACGTTCCTGTTTG 
AATTCTCAATCTCCGGGCTCAAAATTGGCCCCCAGAAAGCCTTTGAGATGGAC 
AGGGAGCTGGACCTTCCCAATTCGGTTGCCCTTGCTTGATGTTTACATGCCAC 
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TGTTGGGTGCTCCTGATTGTGTAGCGGGTCCCCTGAAACCAACACACAGGTGC 
AGACACCCACATGCCATCCTGCCAGAGCCCTGGGTTCTCAGCAGGACTCTTTG 
CCGGGATCCTCATGCTACGGCCACCACCTGAT1TTGTCGATAAAGTTTTATTA 
GGACACAGCCATGCCCGGCCGAATATTGCCACAGAGGCCACCTGCCCCACCA 

5 CGCCTAGGACATTTCCTCTCTGGCCCAGAGGTGTGCCAGGGCTGCACTGGGCT 
CTTCTGGTGGGGAGGGACATTGTTCTGGCCCCTCCTGATCTCTCCCCCACCTT 
GCCACGACCCGCCAGAGTTTGCCTGGTGTCACCGGGAAGCTTCCTGGCTGCCC 
GAGGGTCCCCCAGCACCTGTCCCTCTAGGGCCTCTGAAGGGCTTTGACGTGAC 
ACTCGGGATGGCATGGCAGGGCCCTCAAGCACGGCTGTGTTCACCCCCCAGG 

10 CTGCTCCTGGAAAGGAGGGATCAGGGCCCAGGCTCCACGCTCCGCACTGAGG 
TGTGGTCGAGGGCCCTGGGCGTCCGGGTGAGATGGGAATGAGCCCTTGGTAG 
GTGCCAGGCGGGGCCAGCCCTCATCCCCACTGCAAGGCCTGAGCGTCCAGGA 
GCCCCGTGCGCAGCTGGACCCATCAGTCCCCACCCTGCAGGGCCGTTGAAGA 
GGGAATGCACACCTGTCACCTCTTGGTGCGGGGGCATCCAGGTTCCTGGCGAT 

15 GGGGATGGGGTGAGATGCACCTGATGGACCTCCAGGTGGGAAGCTCCTGGGA 
CACCCTGGAGGGCTCTGGTGGGAGACATGGCATTCTGTCTGCAGCGAAGGGG 
GCTCAGGGCTGAGAAGCCCTGGGACGGGTTGGATTTGGGGCGCGTTCCCCAG 
GGCCCCACGCGTGTTGAGTCCCCGCTGCGTCACTGTCTGGCCTGTGGCGCCTC 
ACCTCTCTGTCTGGCAGGACTCAGGCAGAATGGAAGTAAAGGCAGGACCTTG 

20 TCCCAGGACATCCAGGCATGTGGCGTGGCAACCAGGTGGGATGGCAGGGTTT 
CCCCCTGGCAGGATCCCCACCTCCTGCACCTGTCCTGACTTGGTCTTCTGGGG 
CAGCCCCTAAGACGCTTCTTCAGGGAAGAGGGACCGGTTTGGCCAGTGGGAC 
CATGGCCCTGGGAGCATATGCCTGCACGTCCTGCCCGCTCACACCTGCCTGCT 
GGGGGGGCCATGCCTGCGCCCGCAGCTCACGTCAGAGCCCACCCCAGGAGGC 

25 TCCCAGCGATCTGTCAACCTGTGCTGTTGGCATGCAGCCTGGGGGTGAGGGGT 
GGCGGAACGCTGCCTCAGTACCACGTCCTGCTCAAAGGCCACTTGCGAGCTA 
GGCTGAGCCTGTGGTGTGCGGTGGGATAGGAAGGAAAAGAACAGAGTCTAA 
GGAGCCTAACTCACTTCCAGATTCACCCGGAGGCCTGCATGGAGGAGGCGGC 
ACTGAGACTTCATTCACCGAGCTTGAGGGTTTACCCTAACA^rCATTCACACC 

30 ATCAGTCATTCAGCAAACACTTCACAAACGTGCATCCCACGCCTGGCACTGG 
GCCTTCGGAGGGAGGGAATGCAGCTGGTGGGCCCCTGCCTGGGGAGCCCCTG 
CTCTGGTCGGGGAGGCAGTGGTGGCAGTGGCTGCCTGGCGGTCCATGAACTG 
GTGCAAGCCAGAGCCGGAGGCCTCCTGAGAGCCTGGCAGGGAAGCCCAGGC 
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CAGGGCCTCCGCAGAAGGAGGGTTTGGGGTCCAGGCTTCATGTGGGGGGTCT 
TGTAACATGGGAGGGTAGACGCTGCCTTGGGACAGGAGAGACAGAGGCGGG 
GGGACCAGGGGAGTGGGCTGCAGTTGGGGTGCAGGCCTGGTGGCCCCGGCAG 
GGGGACCAGCCTCTGGGGAGGTGCTGCAGGCACAGGAGTTCTGTGTGGACCC 

5 AGCCGAAAGCTGGCAGGAAGGGACCTGTTTCTTGGGGTTCCCAGTCCAGATC 
TGTGTTGGAAAAAGTGTCAGCTGAAACCCCAAGAGGCTGGGGACAGCCTTGG 
TCCTGCTGTGGGACACAGGTTTAATCTCCAGATGGCTCCTGGATAGGCGGGCT 
GGCGGCTCTGTCTAACGGGGACAGCACACTTGGGGGTTCGGGAGACAATGAG 
GGCAGAAATCTTACCCACCGGCCACACCAAGTCACCCATTGTTGTGTCCCTTG 
10 GGCTGGGATCCCATGGCTTTCAAGAACTTTTTAAAGAAATGAGACTCAGCTGC 
CGCGGAACAAGCCACAGGTAAAGAGTATAAGACAGCGTGAGGCATGGGGTG 
GCCACGGGAGCCCTGGGCCAAGGAGGCCCAGTCTGGTGGGGAGGCGTCCATT 
CACCCGGAGCTGGACGGGGAGGACCAGGCTGTGGGTCTGTGAAGCGGGTGTG 
GGATCACATCCACGCTCCTTCTGTGGTTTGTGCCCTGTTGTGGCTGCCAGGGG 

15 AGGGGCTTCCAAGGCCCTGCCAACCAGCTGGGCAGTGTGGACAATGGCAGAC 
AGGGTGCTGTGTGGGGGCCTGAGGGGCAATGGTTGGGGGGCAGCTGTTGCCG 
TTCTGGTGAGAAGTGGCGGTCCCTGAAGGGCTGGGGTGCTAGGGAGCCTCAT 
GGACCTACTCCCGACGTGGCAGCTGCCTGCTGCAGATGGGCCAGCCATGTCC 
CGCCTGTGCCGGGAATCACCACCCCCAGCTCCCTTCTCTCCAACATGACAGGG 

20 TTCGGGTGATGCCCTGTGGGCTGAATGGTGAGAGAGGCGTCCCTGTGCTGAC 
ACCAGTGGGCGTGGACGTGACCCTCTTTTTAAAAGAATTTTTTCAGATGCAAT 
TCCATTAAGGATCTGGAGGTGAGAGGATGAAAAGGAAACTCTGAGCTTTGAG 
TACATTTTTAATTATGTTTTAAAAATTAAGACATGAGGCCGGGCATGGTGGCT 
TACGCCTGTAATCCCAGCACTTTGGGAGACTGAGTTGGGAGGATCGCTTGAG 

25 CCCAGGAAATCGAGGCTGCAGTGAGCCAGGATCATACCACTGTCCTCCAGCC 
TGGGTGACAGAACAAGACTCrGTCTCAAAAACAAAACAAAACAAAACAAAA 
AACCAAAAAACAACAACAACTAATATATGTTTTTGACCAATACCTAATTCTGT 
TGTTTAAGAATTCAAAATCTACCAATAGTGATCATAAGTAATGTATTAAATAC 
lT/ s CATAGTACAGCAGGTGTTGGCCCCCCCCCACAACATTGGTTTCCATCCCT 

30 GGAAGCTGCAAATGTGACCTGACATAGAAAACACCCCTTTGCAGCTGAGGCT 
GAGGGTCCCGAGAGGCGTGATTGCATTATCCAGGGGGCCCAAGTCCAGTGAC 
ACGTGTCCTCCTAAAAGACAGGACAGATTCAACAAAGGAGGATCCGGCCACA 
TGCTGCACAGCACGGGAGTGATGCCGCCACCCACCCAGGACGCGGGAGCCCG 
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CAGAGGCTGGAAGGGTGAAAAGGGCCCTGTCGCCGCCACCCCCCAGGATGAC 

GTATGGCCTGATGTGGCATTCCTGCCTCAGGACCATGGCAGCAAGTGGTGGC 

GCTGGGGCCCCTGGAGTCGGCCAGTCTGGGCACGTCCCATGCTGTGTCTGAGC 

CTCAGTGTCCCCATCGATTGGTAGTGCCTCCTGAACCCCAATCCAGGGGGCTT 

CTGGGGGGCAGCCTGGGACCAGACGTGCCTCACCAGGGCTTCTGTCTCCTCAC 

CAGGACTTCCATGATGTGACACCTTCCACATGGGGTTCAGAGCGGGGAACCT 

GTGCCTAGTGTCCAGCCCAGCGTGGGGCCCTCGGTGTGGGCCAGGGCTCACC 

CCTTTGGGAGGGGTCGGGAGATGCCTGGCCCAGGCTCCATCCATGCAGAGCC 

AGGAAGCAGGTGAACCTGTGGGGACCTTGGCGACCTCCAAACCCTGCTGAGG 

CTGCCTGCTGACAGCCACTGGTGAGGGGGGTGACAGCCACTGGTGAGGGGGG 

TGACCAGCTGGTAGCTTTTTCTGAGGTGCAGATTCAGCCCTGGGGAACAGCCC 

TGGCAGGCTGTGAGACCTCTGCCCCTGGGCGTCCACATCCCAGGGCTCTGAGC 

TCAGCAGCCCTGGAGCCGGCAGCCACATCAGGGATGGAGGGAGACCTGCTGG 

TGACCAGTGAGTGAGCAGGGGAAGCCTGTCCTCTCCAGCGCCCAGTGTTTCCT 

TCCTGGGGCACATGGGATTTGGAGGCATTTGGAGTCTCCCAGGATGACTTTTT 

TTTTTAAGCCCCTAGAAGTGGTTTGGGGTTTAAGCGTCAATATTTAATCTCTTG 

AAGTTACACATTCAAGGCCCTATCCTGCCTTCGGCTTTAGGTTCGCTTCACAA 

ACCCATTGTCTGCTTATCAGCCTGGCGAGGTCCCGTCCCCAGTAGGGGTCTTT 

GAACATTGCTCGATCTCATTCACCAGCCACGGGGAACATTTTGAACCCTCTGA 

TGCITGCTCTATGAATGTTGTTTGCTTTAAAAAAAGTTTTTi 1 1 11ATTGTGAT 

AAAATACACATAACACACAATTTACCGTCTTAACTCTGTGTGAGGTCACGGCC 

CAGTGGCAATAGGTACATTTACATGGTTGTGCAACCGTCACCACCATCCACCC 

CAAAACTTCTCTCCCAAACTGAAATTCTQCACCCATGAAATCCCACTCCCTCT 

CCCCGGCCCCGCCATCCTGCTCTCCGCAACCGTGGATCCGTCCTCCCGATGTC 

CACAGATCCCCTCTCCCTCGGCCCCGCTGTCCGCTTCCCTGCGACGGTGGATT 

TCGCTGCTCTGGGCCTTTGCGCTTCTGTGACTGGCACTGTGTGTTATCCGGTGT 

CCTCAAGGTGCCTCTCTGCTCATGTCCTTCCATGCATGATGCTTTCAGACTGAC 

AAAACTCTTCAAGTATTAACGTAATTTTTAATGTTTACAGATTCAAGAAATTA 

AGCATACGGTCACGAGA^.tTGCAAAGTTCTGTGAAACGCTCCAGTGGTTACAC 

gccccggggtttcagcttcgaccctgggtgagttccatggtgcagtggcttcc 
agctggagttctaagggcaccccttgagctgcccacgtgtctttgcacgtgaa 
gtaggtggctctctttaaggttcctcaactttattgggaaacatctcagggtt 

ACAAAACGCCCCCCATGGGCTAAAGAGGCTGTCACTCACTGTGTGTGGGGCC 
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TGTCCCCGAGTGGGCATGGAAGGCCAGAGTCCCGGCACAGCCCGCACGCGGA 
CTGGATTTGCTTGGCGGGTCACCTGGATCAACCAGGCCCATTTAGGAGCACCA 
CTCAGGGGCCCTAGTGTGTGCGGCAGGGCAGGTCATGGAGTCTCCAGCCTTC 
AGCTCGCGTTAGCCCCAGGCCATCTGAGCCGTGGAGCCCACCCAGGCCCGCT 
5 GCCTGCACCCCGGGGGAGCCCTCCCACAGCGTCTCGGGCTATGACCCTGGCA 
GGGTTCCTGCATTCTCCTGTGTTCCCGGGCACCCGGGGCTGAGATGTTGAAAT 
GATGTCGGATAATTCACCGGGTGCCTTCGGGGCTGGTACTCGCCGTCACTGCT 
CCTGAGGCCTATTTGCTGTCAAGGGGAACAAGTAGGAGGTTGTTTTCTGGGGT 
GCGGTGGGCTCCCCGCTCATGCAGCTCCCCGCCCCCCGCATGACCACGCCCTC 
10 TTGGGAGCTCGGGTGGGACAGCATTCAGGGGACCACATGAGCTGGGGGGGG 
GTCGGTGTCTTCTCTCCCCCACCeCTTCGCTAGCCTGTCCTGCTTCCCACTCCG 
AGTCCAAAACACGCATAGGATTCGTCCTCTCCCAGGGCCTTAGTTTCCCGGCT 
CTGCAGAGTGGGGGGCTGTGACTGCCTGGCCTTGGAGCTGAGGCAGAGGGGG 
TGGTAGGTGGCCCTGCAGAGGCAGGAGGGGACACAGGCACCCCAGGCCCTG 
15 AGTGCCTCAAGGGCAGGGACCCCAGCTCGGCACCTGTGAGGTTCCCAGCTCC 
GCCCGAGACAAGAATAGATAAAGCGGTAAATAAGATAAAACTCTTTTTATCT 
GGAGTAGATAAAAGTAAGCTTTGTCATATAATTCCTTCTGAGGCTAGCACGGT 
ATATCAAAATATCGGAAGAATCATTTAGCCTTCAGAATATGGCAAGGCTGAC 
AGCCAGCCTGGCCCATGGGCACCTGTCCCCACCATCTGCCTGGCCTGCGGGG 
20 GAACGTTCCTCAGCCCCTGCTGGCTCCCAACCCCCGGGCCTGGAGCCCTGTCT 
TCTGCCTCAGCCGCGATTCCACTGCACGGGACTTTCCCAGGCACCTCTGCCCA 
GGTAGTTGGGGTGGACGCCCACTTGGGTGCACACTCTGTGGGGGCTGGTCCA 
GGACCACCCATGTGCAGTCTTCCCTGGCCACATCCTGTATTCTCCCGCAACGC 
CCAGTCAAGAATCTTCCTGCTGCAGTAACTCCACATATCTGAGCCAGCTGTGT 
25 CTCCTGAGCACAGGACATGGGCAGGAGAGGCCCCTGCAGCTGGAATCCGCGC 
CATGAAGCCCAGGCGCGTGGTATTTGCAGCCAGTCACCAAGTTTTCAAGCCCT 
CACCTGCGGCCTGTGAGTGCCCACCCTGCACATGGTGCTAACCAGCATCCATC 
ACTGCTTTCTTTTTTAATTTAATTTAATT'l"ri"l'l"ri if ITGTGGTGGAGTCTTGC 
TCTGTC ACCC AGGCTGG AGTGC AGTGGCGCGA Z CTCGGCTC ACTGC AACCTCC 
30 ACCrCCCTGGTTCAAGCAATCCCCCTGCCTCAATCTCCTGAGTGGCTGGGATT 
ACAGGAGCACACCACCACCCACGCCCAGTTAATTTTTTCATATTTTTAGTAGA 
* GACGGGGTTTCACCATGTTGGCCAGACTGGTCTCGAACTCCTGTCCTCAGGCA 
ATCCTCCCGCCTCGGCCTCCCAAAGTGCTGGGATTGCAGGTGTGAGCCGCCGC 
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GCCCGGCCCCACCACTGCTTTCTATGCTGTGGCCGAGGTCACCAACTGCCACT 

GTGATGGACGGCTGTCACCCAAACAGCACGAGAATCGAAAAGCCGTGATGGC 

AGATGCAGAAGGCAGCGTCGCCTGTGGTCTTTTAGAAGGGCAGCTGTGAAGT 

GGGGTTATCTCAGCTTCCACATTGCATCACGGTGCCCCTGTGAAGGGCAGAG 

GAGCTGCAGGTAATTCTGTTCGCTCGCCTCTGCCTTGGAGGAAGTTTACAGTT 

TTTTTTTTTTTTTTTTTTGAAAGTTTAAAGCTTTC 

CAGGGTCAGCAGACCTGGGTCCTGCCCATGGCCCCCTTCCCTCCTCAGGCCTC 

CATTTGCTCATCTGAGAGTGGGGGGAGTGCAGCGCCTTCCGTGCAGGTCACTG 

AGAGGCTGGGGCAGTGGCGTGGATTCGGGTGCATGACTCGCAGACATGGCTG 

TCGCTTGGCTGAACGGTGGCCCCTGAGGTGTCTGTGTCCTTGTCCTGTGAACA 

AGTTAGGTTGTGTGGCCAAAGGGGCTTTGCAGGGTGACTGAGGGATGGCTCG 

TGAGAGGGGAGGTTGTCCTGGGTTACCCGGGTGAGCCCTCACTGTGATTGTG 

AGGATTCTTCTAAGAGAAGCAGGAGGATCTGAGTTGGAGGAAGAGACGGGA 

GGATGGAAGCCAGGCTCGGCGACGGGAAAGCTGCTGTGCCACCCTCACCGCA 

GCCCTATCCCTGCTCACGTGCACCAAGACCAGAGCCTGGCAAGTCGTCTCATA 

CCGAGGGTCAAAGGGGCCCAGGGCCCAGCCTCCAGGGACCCCAGCCATGCCG 

GGTCAGGGAACCAGGCTGCAGTCCTGTGTCTCCTCCTCAGTTTACCCCAATTG 

TGACCTCAGAAGCCCnTGGAGCCTCAGTTTCCCTACTTGTAGCATGTGTTTGA 

TGATAGTGTGGCAGTGATGTCACAGTCTGGCAGGGTCTCGTGGGGGTCTGTGT 

TACTGTCTGCAAAGCACTGGCTGGGCCAGTGGTCAGTGAGCAGGGGCGTGGG 

TAGCGGGCAGTGGCCTGCAGTCCCCAGGAGCACTTCCTGTTGCTGAACAGAC 

AGCAACTTAGTGGATCCTTCCCAGGGCCCCTGGTGGGTGGTGGGTTTTGCCAC 

ATCTTTGCAGTAGACAGCACTGGCAGCTTCCGGGTTATCCAAGGCCTCAGGGT 

GAAGTCGACGCTCGCTGCTGATTCCCTGCTGGACGTCAGGTGACACTTCCCTC 

CACAGCCTGGGAAGAAGCACCTGTGGGCCACTTCTTTGTGTTGATAAGGAAA 

CTGAGGCAGGGGAGCTGCCTGGGTCACCAGTGGGCACTGCATCTCCATGTGC 

AGATGGTGCCCTGGCCCTCTCTGCACGCCGTGGGCCTGGGGAGCAGGCAGGT 

GGCATCTTCTGCCCCGGGCAGCCCGGCTCTGCTTCCTCGTGGCTGAGCCAGGC 

TGTGTCT AATCTC ACTTCCCTGCTC ACTGC AAATC AAAACAAACAGA i 'GTCC A 

GGAGGATTAAGTGTGGCAGTCATGGCGGAGGGAGCCCTGGTAGGGTGGAGA 

GAGCCAACAGTTGCACCAGCAAGGGGCAGAGACCCAGCCCAGCCCTGGGGG 

CTGCAGGGTGTGGGGACAGGGCCTGCAGCTTGGCACAAAGACTCTTCTAAGA 

ACCTGGAGGGAACCCTGCTCCGGCAGTGGGCAGCATGCTGTGTGCAAGGTGA 
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GCCAGTCTGAGCCCATTCCCAGTGCCCGGAATGGTGCCCAGTGCAGATGGGC 
. ACTCACCCTTCCAACCAACCTTTGCTTGTGTCCCCGTGAGATCCACAGATTCA 
CCCTACGGAGGGCTCCCCACAGGGTTGGCCCGGAGGCTCCAGCTGTGTCTGTT 
GAACAAATTACTCTGCATCCACCGCGTGCTAGCGCTGTGCGGATTATAGTCCC 
5 AGAGACAGATGAGAGCAGGGGTGGTCAGTGTAGGAGCACCCAGAGGGAGCC 
TGGCAGGGCCTGGACATCCTGTCCAAGCTGGCCAGGAGGAGTCCTCAGAGCC 
AAAGCAGTGTGGAGGGTGGGGGTGCAGAGAGCGTCCCTGGCTGGAAGCACC 
GCGTTTGCAAATGCCAGGCAGCTTATGCAGCCAGCGCTTTGGGCAGGCGGGA 
GTCAGGGGGACTGAGGCCCAAGGAGGGGTAGGGCTGCATGCCAGCCTTCCAG 
10 GTGCTCCTGGGAACCACGTGGCTGATCAGACTTGTGCTTTGGGTCGAATGTCC 
CAGTGGGGGAGTAACCTCAGGGAGCAGAGGAAGGGTCATGGTGGTGGGGAT 
GAAGCAGCAGCATTGGGGCTTAGAGATTAAATGAAGGAAGTCCCACGGGGCT 
GATCACCTACTGTGCACCAGGCTGCGTCCCGGGCATGGACTGCAGCCCACAG 
CCCCACTCTTGGGGTGCTCATGGTCTAGGGGGTGGACAACAGACTCTGGTCCC 
15 AGCGATGATGCTTGAGTATAAAGGACACAGTTGGGGTGTTGTGGGGGCCAGG 
GTCAGACATCAGAGCTGGCCTCTGCCTATGAGACAGGGCCCGCTGGCAGGAC 
CCTCAAGCTGGGCTTTTAGGCTGGGCAGCAGCAGGTGCACGGGCCCTGAGGT 
GGGCCGGGCTGGGCACAGTCAGGACTGGCAACAAGGGCCGGGAGGAGCTGG 
GGTGATCTGGGGCCATCGTGGTGTAAATACTGGGGCACGACATGGAGGGTTC 
20 ACTGACTGGCTGGGTGTGTGGGCTGGGGGCGCCGAGGGTGGAGGTGGGCACT 
GTGGTCAGGGGGAGATGAGGATGGATGGGCAGGGTGCAGACAGGGGCTGGC 
TGGGTTCGGCTCTGCAGGTAGAGCTGACCGGTGTGGCAGGGACAGGGTGTCG 
GTGAGGCAGGTGGCTGAGGAAATGGTAGCTGCAGGTTTTGGCCCCAGTCCCA 
GCAGCGGGACTATGGCTGTGGGGAGGGAGGAACCCAGGGAGGGGGCCTGAG 
25 CAGGGTCTTTGGCTGAGTGAGCCCTCAAGGGCCTTTGGGTAGGGCTACCAGA 
GGCCAGGAAAGGATTGTCCTGTCCACAGGAGCTTGGGGAAGGGGGTGGGGG 
GCCAGAGCCCCACCTTCCCCAGAAACCTCCGTGCAGCAGGAGGATGAGCCGA 
GGCAGGCTGTGCTGGGAGGTAGCCTGTTTGTTGTGGGCTCTGGGGGCCTGGG 
GAGGGTGGi'iCTGGGCTGTCCACATCTGGACAGGTGGAAGAGGATGGGTTCTG 
30 GGTGGGGCTGACTTCAGGAGGCTGGATGAGAGGCAATGCAGGGGGCCTTCTG 
GCTGTCCTGGGGTGGGTTAGAAGCCACTCAGGATGGCAGGATCTCAGGGAGC 
GGAGGGAGCCTGGGGCACCCTATGCCAGGCAGAGGATCTATGCCGGGGCCCC 
CAGGGTCTGTGTGTGGCTTTCTCTGTTTGTGTCTCCCCTGGCCCTGGGGGCCAT 
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GTGGCCATCTCCTGGCTCTCCGGTCGCTGAGGATCCACGGGGGTCCCTGGAGT 

CTCCGGAGCCCGTGGGAATCCCCCTAGAGGCGGGAGGAAGGCCGGGCGGGC 

CCTGGAGGGTACGTTTGCAGTGGCAGCAGCTCCCTATCTTTATGGTTCCGGCG 

ATCAGAGAAGGCGGCCCGTCTTTTCTCTGCGGGCCCACAGGCCCAGAACGTT 

GGCCCCAGGGTGGGGCTCTGTGCGCCTTCCCTTTCATGAGTTCTGTGGCTCGG 

CCTCGGGGCCTTGGAAAGGAGGAAGTGAAACTGGAGGAGAGCGGCCCACAG 

CCTGCCGCCTGGAGGCTCCTCTGGGAAGCAGACGCTTGGTGCAGGTGTCAGG 

ACCCCCACCTGGCGGGCTGGCTTCTGTGCTGTGACCACCAACACCCCTGCCCT 

GGGCCCTGAACTCCTGGACAGGGGGCGTTGACTGCTCTCTGGAAGTGATTTGT 

AAAATCAGCTTTGTAGAGACGTAATTACACACTGCAGTTCATTTGTGGCTCTA 

GGGCCCTAAGTGGCTCTAGAATGTTCAGAGTTGTGTAGCCACCACCACCATCC 

ACATCCAGAACGTTTAATTACCCCAAAAGAAAACCCCACGCCCACAGCAGTC 

CCGCCCTGGGGCAGCCACCAGCCTGCTTTCTGTTTCTGTGGCTCTGGCTGTCC 

AGGGCGTAGCCACACTGCAGGTGGTCAGCAGGTGGTCACCTGTCTGTGTCTG 

GCCTTGGCGCCATGTTTCTCGGACCCATCCACGTGGCAGCATGTGTTGGTGCT 

TCGTCCCTTCCCGTGGCGGGACGGCATCCCTCGTGCGGACCTACCTCTAGTGT 

GTGCITACCAGCTAATGGATGACTGGGTTTTCGAAGCACTGTCTGTTCCTACC 

TGGGGGCGGGGCTGAGGCCAGGGGCCCCTCCACTCCCCAGGTGCATCTGTGG 

GATGGGCAGAGGCCGTGATGCTGACTGCCGTGTCCCTGTCTTGCAGCTTCCTC 

ATCGTCCTGGTCTGCCTCATCTTCAGCGTGCTGTCCACCATCGAGCAGTATGC 

CGCCCTGGCCACGGGGACTCTCTTCTGGATGGTACGTAGCATCTGAGGGCATG 

GCTGGATGTCATGGCTGCCTTGGAAGCTGGCATCTCCCTGGCGCTGGGCCCCA 

TAAGGTGGGGGGCAGAGCCACTCCCAGCCCCTTGCCCACACATTGGTCCTGC 

CCTGATACAGGGGGCACCTCCCCAGCCCCCACACTTGCCAAGTGACTTGGGA 

TGTATGTGCCACAGGCGAGGGGGACCAAGGACTGAGGGGAAGCTGGAGCTG 

GTGGTCTGAAAGGGCTTCCTGGAGGAGGGGCTTCCAGGCTGGGCTGGAAGGA 

AGGAAGGAGGAGGTAGGGTAGAGCTGGTGTGGCCAGCAAAGGCCTGCCCGA 

GCCTTCGTGTCCCTGGGGTCTGGGTGACTCTGTTCCTGGGTTATGGCTGGCAC 

ACAGGCCCTGCTGGTGTCCCAGC/iGCAGCCCCCAGCCCAGGCACCTGGATGG 

TCCCTTCCATTGGCCTGAGCGGTGTCTGTGCCATTTCTGCCAGGCAGGTGCTG 

AGAGCAGGGCAGGGCAGGCTCATAGCCTGGTGGGAAACCCCAAAGTTGATCT 

GGCCGTGAGGCTGCCTGGAAGGTGGGGTGCAGGGGGCCAGCTGTGTCTGGAG 

CCCACGGGGTCCCCACGGAAGCCACCTGAATGTGGGCCGTGAGCAGCAGGAC 
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TCTGGCAGGACTGGGTCTAGCACCTCTCCGTGGCCAGGCTTTTGTGTTTTCCT 
GTCTTGTCGTGGAATAGTTCTAGTATCTACAAAGGAAGAGAGTGTCACGACTC 
AGCTCCCGGCATGCACAGGCAGATCTGGGCTGGTTTCATGCTGCTTTCTGGCA 
CTGTGTCTTCCTGGCCAGGGGAGCCCGTCCCTCCGGGGTCCAGAGTCCTCTCG 

5 CGATGGCCTCGGCAGTGGTATCCGTCTCATTTCTAGGGAGTCCGTTTTACCTTT 
CCTTTGTTGCTTCTGGACCTTGAGCCATAGCTGGAAGAACTCACCTGTGGTTT 
CCTCCAGCCCTGACCAGGTTAACTGTGTGTGCTCCATGGTGGCTGGGAGTTTC 
CTGGGTAAGTGGTGTGAGGCACAGGCCTGCTGTCACTCTTCCCTCGTCTTCCC 
AGTTCTCCCAGCACTATGTTTTCCACAGTCCATCTGTCCTGCCTGACTGGAGG 

10 GGCCGTCTTCTCTGTAAACCAGACGTCCAGAGGCAGTAGGAAGGGTCTGCTTT 
GGGGTTTTTCTATTCTGTCCCACTGGTCTCCTGGACTTTTCCTTTACCAGGGCC 
CTACTGACTTGGTGGTCCCGGTACTAAAATCAATTTCATGATGTCACTTTGTG 
CCTCGGCAGGGCTTTCTTTTTCGGGGGTTTCCTGGTGGCGCTTGCAGCCTTGCT 
TTTCCCAGTGAACGTTGCCATCACCTTATCCAGCGCCGGGAAGAAACTCGAAG 

15 GCCGCGGGTGAATAAACTGGGGGAGACTGACATCTCGGTGGTCTCCAGGAGC 
ATGGTGTGTGTGTTTATGGGCAGCATTTTATGGTTTCATTATGTAGATTGCAG 
GCAGTTTAAGTCTATTCCTCATCATAATAATATTGAGAGAATAAGTTGTTATT 
GTTGCCATCACATAGGGAGTTTCTAGTCTATCATGTTTTCCAGGCACTTTCCTG 
GCGTCTGCGAAAGCTGTTAATTTCTGCTACAGCCTGTGGTCCTGCTACTTCTC 

20 GTGACTGTGGTCGTAGAAGTTTTTCCCTCTGTCCTTTGCATCTTTTGCGTTTCC 
AGATAAGGGATGCCTTCCAGGAGGGCAGATTGCTGCTTGGGTCTGTCCAGAG 
CTGGGGGTGGTACCTGCGGCCAGCGGGGCCCTCCTGGCTTCTGTGGGCGGGC 
CGGCTGCCAGGCGGGGGGTGCGCACCTCCCGTGCTCAGGAAGGGTCCTTCAG 
GTCCGATGCCTGGCGTGGTTTTACGGTGACCATGTGGATTCTGGTCAGGGGTC 

25 TTTCCTGCTTCTGTGAGGTGCTCCTGTCAGTGAGGGGCCCCCCTTGCCTTCCTG 
GAATAAACCCCCTTGGTCCTGGCGTGTGAGTTTCATGTCCTATCACATTGTAC 
TCTCATGCTTTGTTCAGACTTTTGCACTGCCCTGTGAAGTGGGGTGTGTGTGTC 
ATCTTGTGGTGCCATCTTGGTTGGGGTGAGGCCGTCCTTCCCCCACAGGGCGA 
TGTCCAATACCCTTGGCAGCATGGGCCTCATGTTTTATC^GGGTCCCCAAGGTA 

30 CCTGCCCAGGATCCCCAAGTCCTCCTCACCCACCACAGACCTGGCTCTCAGAT 
GAGCACCCAGCCAAGTCTCTGGCCATGTCTAGCAGGAGGATGTTCTGCCCCG 
GCCAACTGACAGGGTTGGCTCAACTGCAGGAGTGGGCACGGCATGTCCTTTG 
CTGGTTCTGCCATGGGGCGGGTCTGTGGGGGGTCCCTGACCTGGTAGTTGTCC 
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CTGGAGGGCCTGGGGCCAAGGATGCCAGTGGTTCCCAGGCTGTGAGAGCCGC 

GCTGCCAAACATTCCCCTGGAGAGTCTCCGTGTGAGGCCCTTTGTACCTCTGC 

AGGTGTGTGTACATGTGTGTATGTGAGTGCCCACATGCACGGGAATGTGTAC 

ACGCATGTGCGTGTTTATGGGTGAGCTGTGTGCTTAAGTGTGCATCTGTATCT 

GAGTATTATGTAAGTGTGCATACACATGAGCACGTGGGGACTTTCACGCATGT 

TTGGCTCTGTTTCAATGTGTGTACATGTGTGCACATGTGTATCTGAGCGTGTA 

CATGTGTGAATGTGGCCACATATAGGCACATTGTAGTGTGCGCCTGTGAGCAT 

GTATTGTATGTTTATGCATGGCTGTGTTGTGTGTATTTATGTATAGCTATGTGT 

CGCTGCGTGTGTGTACCTATGTGTATGTACGTGTGTTCATGCCTGTGCTGTGTG 

TGAGACGTGAGATCCTGCCTGTCCCATCCCTCACGTACTAACCTGGATCTAGG 

TGGTTAGCCCCTCGCCCCCAGGTGGCCTGGGACTTGCCTGGTTTTAAATGGAG 

CCCCGGTGCTAACGTGAGCCCACCCAGGTGCTCCTGCCCAGCCAGGGCTGTCT 

GTCAGCACCACAGCGTCCAAGTTCAGGTCCTCACTTCCTAGGTGGTTATCTCC 

CAGGCAGCTGGGAGGGCGAAGCCTGGAGAAGCCTGCACCCAGGCTGGACCT 

GCAGGGACCTTCAGACGTGCCCTGGGCTCCACATGCCCGTCTGCAGCTCGAG 

AATTAGACGTGCCCTGGGCTCCACATGCCCGTCTGCAGCTCGAGAATTAGAC 

GTGCCCTGGGCTCCACATGCCCGTCTGCAGCTCGAGAATTAGACGTGCCCTGG 

GCTTCATGTGCCCGTCTGCAGCCCTAGAATCACCAGCAGCACCCCAAGGCCCC 

CATGCGGCATCTAAACCAGGCAGGGGACAGGGGCATGGAGCAGACCAGGTA 

GGGTTGCTCCCTAGGATGCAGCCCTGCCCAGGCAGGAGGGCAGACCCTACAC 

AGTGCTGAGCTGTGCCCCCAGGGAAGCCTCCAAGATGGTCCTCATCCCAGCA 

GCCCTGCCCACTGTCTGCGCATCCCTGGGGTCTGTGCGCCACCTGGTGGCTAT 

TTGCTGAACTGTGGCCACAAACCCCTGGTCAGCAGGAACCGGCCCTTTCACTG 

CCCTGCAGCTCCCTCTCCTCCGGGGTGGCATGGCAGAGAAGCAGCCCGTGGC 

CCCAAGCATAGAGCCCCCTTTCTGCTGTAGTCCCCCTTGCAGTCCCCTCTGCCT 

TTTCCACTGCTTGGAAGGCAGAGAGACCCTGGCCGCCTCCCTTGTCCCTTTAT 

CTGTCATGCCAGCTGCCTCCCTGCCACACCTTGCGTCTCACCAGGCAACCCTG 

CAAACAGCAGCTGGGGAGGACTTTCTAAGACAAAAACCTGATCATTCCAGAA 

GCTCCCrGTGACGTTTAGGCTACATTCCCCAAGGTGGTGCCACGTTCAGGCCC- 

ATCACCTGTAACACTACCCACCCCACCTCACCCAGCCTGGGCTCACTCACTCC 

AGGCCAAGCCTTGGACATTCCTGGCTGCCCCAGGGCCTTTGCAAGCGCTCTTC 

TCTGGACCTGGCTGTCCTCGGGAGTGTCAGCACTCAGCCTCTTGTCAGGTTTC 

AGGCCTCTGCTTGGCACTATCCTGCACACCATCCCCGCCGACCTGCCCTGTTC 
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TGGGTTCATGGGGCTCTGGGAGGCTCACAAGCATCTGTCTGCCTCAGGCCATT 
CCACACTCCCTGAGGGTGGGACCCCGTGGCTGCGCTAACCCCACAAGCTCTG 
CATGGCGCTTCCTGGCCGTGCATGTCCCTCCCTGCATGAAATGCATAGGCCAG 
AGTTGCCCAGCGAGGATGACCCAGCTTGGTATGTGCTCCCGGGTTCAGCTGCC 

5 CCTGCGTGATGGAGACACCCTCCCACTCGCCTTGCCACACTTCCCAGCAGTCT 
CTCGAAGCACTGGCCGCGTGGAAACAAATCCTTCAAGGGGATTTGAGGGTCC 
AGCTGTCACTCAGTCATGGCCATGGTGGGGGCACGTTGGGGACACAGCAGAG 
AGTGAGGCAGCCGTGGGCCTGGCCTTGACAGGCGGATGTACAGACACATCAG 
GGCCGAGACTGAGGCGTGGGCTCTGGGTGGGGAGACAGTGACCACTTGCTTG 

10 GAGGAGGGGGCGCTCAGGTGGGAGGGGCACAGTGAGAAGGAGGTAAGCTGG 
TGCTGGCGGGGCAGGCAGGGACCCCTCCCAGGGCTCACAGCTTCCAGCGGAC 
AGCAGGCACAGGGCTGTGGGGATCAGAGCACAGGCAGAAGTCACCTGACAG 
CAGGTTCCCGAGGCTTCAGGCCTGGAGGAGTGAGGACGGCTGTTGGGTTCTT 
GATTCCAGCAGAGGGAGGCTGTGCAGGGTTGCAGGAAGGACGGTTGGGGGC 

1 5 GCTTCGGGG AGGTACGAGAAGCAAAGGtGTTTGCAGCGCTTGGGGCCAGGCG 
GCGGGAAGGCCTGTAGACGCCAGAGGGCGCTCGTGCTACATGCAACTCAGGA 
GCTGCTGGCACCCGCCAAGGAGTGCTGAGCGCGGGGAGCCCAAACCACAGCC 
TTGGGGAACCCAGGAGGGGCCGGGCGGGTGCAGGACCCGCATGAGGGATTC 
CGAGAGCACGATGTTTAAATGTTGGGGGAAAAAATGGATCAAAGCGATGTGT 

20 TCCCAGGGAGCCCCTGTTTTCTCGTTTCITrTCGTACGCCCAGCrTTGCAGTGC 
CAGAAAGCTGTTATCTCAGCACTGCTAAGGAAACCCCAGCGCGGCCTCGGAG. 
GGATTAGGCATTCTGCTGAACCAGGGATATGCGGTGGGCCTGAAAAAGCCAT 
TTCGGGAGGAATCTGGTCTCGGCTGGGGTCCGGCCGGCACCGTGAACACCAC 
GTGAACACCACGTCGGAAATCTGCTGGGCAGGGAGATTTCCTCAAGGCCTCC 

25 CAGGGAGGAGAGCCACTTCCTGGGCTGCAAAGAGGGCCCCAACCTGGCTTCT 

gaattcagagggggctgtgtgtgcacgtgtgtacgtacgtgcatgtgcacgt 
ctgtgtacccatgtctgtggaagtggcacacgtgtgcatgtgtgcactcagtg 

TACGTGTGCGTGCCTGTGGGTAGCTCCGTGCGCACATGCGTACACATATGCCT 

ggtgtgtgtgtgc^:acgtgtgtctgacgtgtgtgtgctttgcgtgttcttgcc 
30 tgggaaggagttcccggtttgggtggctctcctgtttcttgcggaagccagct 

gctgctagaaactggccccctgctctctgcagtaagagaccctgccctggccc 
• aggtgagggcctggcttttgtctccaggatcacagcttttgtcaaactccaca 

caaagagagaggcactcaaaagcacaacacaaacacggctagtccgcgcctt 
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TGGTGGTGAACATACCAGGGAATTGTCCTGCAATGCCTGTGGTTAAAAAACC 
CCGTGAGCTGCGGCCCCGGAGGCCTAGGGGATGAGGTCATCGCTGCGGCTTC 
CTCCCCTCTGCTGAGGCTCCCTCCGCCCCGCCCGGTGGCCCACCCCTCCCGCC 
AGCTGCAGCCTCCACTGCCTGGGTTTCTGCTG'CGCCTTCAGCACACAGCGAGG 
CCTCAGTTTCCCTCCCAGCTCAGAGCAAGGCCTCAGTTTCCCTCCCAGCAGCA 
AAGTGAGACCCTGTAGGGAGTCTCTGGGCCTCCGTGACGGCCCTGGGCGGCC 
CTCTCTGTGGAACAAGGTCGGGTCCTGTCCTCAGCGCCCTGGCCTCCACTTGC 
TCAGGGCGCCCGCAGGCCCGCAGGTCCTCACCTGGTGTTTTTCACCTGGGACC 
CGCAGCTGCTCTGATGAGGAGCCTGCTCTGCGCACGGCCTGGGGCGGTTTCCC 
GCAGCACCATTTCTGCCGCTGCCGGGAAAAGACGCAGAGCAAAGGCACCTTT 
GCCCAGGTTGGGGAACCAGTGGGTGACTCAGGATTTTCCTCCACCCACCCCG 
AGGTCTGACTCCCTCTGCAGCTGCCGCCCGCTGAGGGTTGTCTTCCTCCCTGC 
CTGCCTCGCCCTGCCTGGTGAGCTCCTATGGGTCTGACGAAGCCCCGTCCTGC 
ACACTTGCTTCGGGGGCAACTCCCCACTAGACTGGTTGCCCAGGGACGGGGG 
TACCCACTCCCTCCTCAGGGCTGCAGGGTGCTCTGGGAAGAGTTGTGTCCCTC 
TGTGGGCCTGTGGGCTCACTTGCCTTTTTGTCCCTGTCACTTGGGAGCTACCCA 
GCTAAGACCGGTCCCCACCACCCAGGCCACTCTGAGGCTTCAGAACGCGGTC 
GTCTTCTCCCTGGCAGGGTTGTTTCeGCCTGCATTTGTTCAGCCAGCAAATCTT 
TATTAAGCACCTACTGTGTGCCAGGATCTATGTGCAAGCAGCCCATGGGCTCA 
TGAAAGGCAGAGCTGGACTGAGATTCCGCACCCCTGGGATCCTCACTGCAGG 
GCCCTGTGTATGTGCGGGCTGGAGCCCTGGTTCCTGTCTGCACTGAGCCCCTG 
GAGCTAGGCCCAGAGGACTTGACAGATGTGGGCCCTCTGTCTGCACAGCCCC 
CAGGGCACGGGGTGGCCTGCAGGTCTGCCCAGCCAGACTTCAGGTCACTGCC 
CCATGAGCCTGTAGGAGAAACTTCTCACTCATCTGTCCTGAGACCTGCCTGGA 
GCCCACCCCAGGGAGACAGAGGCTCTCGGAGCCTGTTCAGGGTGAAAGGACG 
CTAGACACTGGCCTGCAGGCTTTGGGCGGGTAAGGGTAGCGAAGGTGAAAGT • 
GGACCAGGGGCAGGTGGAGCAGGGTAGGCCCAAGTGTCTGTCTGGGGCAGCT 
GGGGACACCCAGGTGGGCTGAACTTCAGCTGCCACCCAGCTTCAGCTGCCAA 
GGCTCCTCCCGGGTCAGGGTCGACCCCK^ATGTCCCGGCAAGAGTTCCAAGA 
30 AGAAAGGCTCAGAATCCACCAAGCGGAACTTGAAGCCACTGCCCTGTGGGGA 
CAGGAGAAGTGACTTGGCCGGCTCTCCGGAGCCCAGTGTTGATCACTGAGGA 
CCACCGGATACAAGTCCCTCCTGCCTTCCCCTTTAGTGACACTGAAGCTGTTG 
AAATCCCCATCCACCAGGAGCTCCTGGTCCTGGGGAGCCCTGTGCAACCCACT 
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CACTGCCTTTCAGAGCCCTGTGGCGTCTCTCTCATGCGCTGG.CCGTAGAGCTT 
CCTGAAAGCATCTCTATCACCCACTCTGCACCTGAGAGAGGGCGGTCAGGTG 
ACTGGTACCCCCTGAAGACCTGGGACCAGGCCCCCTGCCTCTGGGAATCCATC 
AAAATGCTCCCTTCTGAGGCCGCTTTGCTGGGCGTAGGATCAGCCTTGCTTTG 

5 AGGGCTGTTTAGGAGAGGAACCCCGGCGCTGCTGCTCCCCGGAGGCAGCCAG 
TGCTTGTGATGGGCACCCTGCCTCCAGTGTTGTGCTTCAGGGTGTGGGATGCA 
CAGCTGGGGGCAGGGGCGCTGCTGAAGCCACAGTCTCAGAGCCTGAGGCCCA 
CGGGGCATGGCATGAAGACCCGGTGGTTCTGTTCCCCCGGGACAGCCTTGGT 
CCAGCCTCACTGGCCACATGCTCTGCCGAGCAGACCCGCTGCTGTCCCCAGCC 

10 ACCCAGCCCCATGCACAGGCCACGCGGCGACAGGGGCTGCTGAAGGAAGGCT 
GGTCCGCTCCCTGCCTGTGACAGGAGCTCAGGCTCAGGGCAGCAGGGGCGCT 
CAGCTGGGCCGCGGGTCCACGGTGTTAAGGGATTCTAGAAATTTCCTGCTGTG 
CCAGGCTGCAGCAGAGACTCCGGCCCCACCAGGCTTCCTGTACTTGGTGATA 
AACACACCATCCGCACTGCCTGCGGCTCTTACAGGAGTCTCTCGTGCACCATT 

15 GCTCAACCCCCGAGAGTTATGGAAGGAAGGGAGGCTGGGGTGGGTGCCCCGA 
GTGCCTTGTTGGAGTGGAGGCTGGCGCTGCCCTGCCCCACAGGGTGGCTGGC 
AGGGCTCAGCAGTTGCCTGGCCTGTGGGGGCAGAGGACCTGGGGAGACATGC 
TGAGCCCTCCCAGCGAGACACTGAGGGTCGGGAGGGTAACATGTGATTTTGA 
GGCCACCCGCTACAGCTTCTTGGGACCTCCCTCTCTCAGGCGGGTCTGGAACC 

20 CAACAGAGCTGGTTTGGGCCACAGCAGTGGCTTCCCAGGCTGGGCTTCAAAA 
CCGGACACAGGGCGTGGCTCTCCCTCCCAGCCTGCCAGAGGGACCGCTGGGC 
CTATCTCCAGCCACGTGGGTCGCGAGAGTCGACCTGGGCTGCGTGATGGGGG 
ACCCCAGGCTGGGCGGCTTAAACCGTGGAGGTCGCCCTCTCACAGCTCTGGA 
CGATGGAGGGCCGAGACCCAGGTGTGGGCAGGGCTGGTTCTTCTGAGGCCCC 

25 TCCCGGCTTCTGAGGGTGGCTAGCAACTGTCTTCTAGATGTAGTACCCCAATC 
CCCCCCAGTCCCTCCCTTTACCTCACAGGGCATCCTCCCTGCGTGTGAGTCTG 
GAGCCAAATTTCTCCTCTATGTAAGGACACCAGTCTTACTGGATTGGAGGCTA 
CCCTACTCCACTGTGACCCCCCTCCTAACTGATCATATCTGCAGGGACCCCAT 
TTCCAAACAAGGTCTTACTCTGAGTACTGGGGATTAGGGCTTC>/rCATATGAA 

30 TTTGAGGGCAACCCAGGGGTCTCCAAACCATGGCCCACAGGGTGGCTCCCTG 
TGTGTGTAAATAAAGCTTTATtAGCACACAGCACCACATACTTGAGGGCCTAA 
TGCCTCTGGCTGCTTTGTGCCATGGCAGATTTGAGTCCTGCCATGGAGACCAT 
GCAGCCCACAGGGCCTAAGATTTCTACTACCTGACCTTCCAGGAAAGGTTTGC 
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CACGCTATTTATTTAGCAGGATAATTTTGGAAAGAGCAGTTGCTAGGGGCATT 
TAAACTTGCAGTCGAATTAGTAACAATTTTATTTTTTGGGAAAATAGAAGTGG 
GAGAGCGTGTGCCCCGGGCCAGCCCACCGTGCCCCACGTTTCTACAGTGCCC 
AGGTTGGCGCAGGGCCTTTTGGTTCCACAGTCTGTGTGTTTCTTCCTGGGTGTC 
5 TGAATTTTGATTTAATCTCAGGCTTGTCAGAACGAGCAGCTCCCAGGCACCCT 
CTGCCAGGTCAGCAGTGGCTTTCATCTTCTGCCATTTGCTCTGCACGTTTCTCT 
CTATATGCAAAAACCTATATGGGTTTTGTGGGGTTTTTTGTTGTTGGTTTT^ 

TAGTGGCATACTCACGGCTCACTGCAGCCTCGGCCTCCTGGCCTCAAGTGTTC 

1 0 CC ACTTC AGCCTCCTGAGCAGCTGGGACTGTGGGTGCGTGCCACCATGCCCCG 
CTAATTTTTGTATTTTTTGTAGAGATAGGGTTTTGCCATGTTACCTAGGCTGGT 
CTTAAACTCCTGAGTTCAAACAATCCTCCCACCTCGGCCAACCCAAAGTGCTG 
AGATTATAGGCATGAACCACTGCACTCAGCCTGCTTTGTTTTTATCCTGAGTC 
ATTTGAGAGTAAGTTGGAGGCAGTATGACCGTTTCTCCGTATAAAGCCCGTGT 

1 5 AT ATCCTCTGGAATGAGGACCTTCAACTTAGCCAGGGTGTCCGACTCTCAGGA 
CACCGCTGTCCATTTCACAGCCTGCATTTAAGGCTCGTCACTGGTCCCAGTGA 
GTGTGGCTGGTTTTTTCTCATTGCACATGGCTGTCACGCGGTTTCAGCTTCTTT 
GACCTGGAGCAGCCCTGCCCGGCCTTCCCTTCTTGGTCTCTCTCTTGGGCAGG 
GAAGGGCGGTGGACATGGAGGCTTGGGGACAGTCCCCTGGCCCCACGGATGT 

20 TGTGTGGACGAGTTGGAGGGCTCACAGGCCCCTCTGTGGGGCCCAGGGCAGA 
GGCAGCAGGCAGGCTGTCTCCACCACGATGGACATCATTTATGCCCCAGCCA 
GGTCCCCTGTCCCAGGACCGCGGTTGACAGGGTTTGGATTGGCTGCGTTCCTG 
TCTGAAGGCAGCTGCTTGGGAGGAGGACAGCTGACATTCTTTCATGCGAAAT 
CTGCTTGCGGGAGAGTCGTGAAAGTGGGAATCCTGGGCTGGAACCGGAACGT 

25 TCCCCGAGTACATAGGAATCCTGGGCTGGAACCGGAAACTTCCCTGAGTATA 
CGGGGCCTTGTGTGTGGAGGGGCCCTACGGTGAATCGTTTTCTGTAACAAAG 
GCTTCCTCTGTGCAGCCCACACACGGGCCTCACGTTATCTTCTGCAGGCCAGA 
ATTGTCATATATATCAAGTTTGTTCAGAACAAGATGGGGTGGGGTGGGGGGC 
AGTGVGGGGCCTGGGGCAGGAGGTAGAGGAAACTGCAAGATTTTTTAAACCT 

30 TCTCCAAAAACACGTAACCCGATTCTGTCCAAGCGATTTGTGAGAATGGGGTT 
GACACCCCTGGGCTGGAACCTGGGAACCACCAGTCAGCGTCTTTCCTCCAGG 
GAGGCTGTGCAGAGGGAAGGCAGTGAGCCCCACCTGGGTTTGAGGAACCTGG 
GGCTTCAAGGGAGGCGGTCTCAGCCCATGGTGCTCCGGCCAGGGGTGGGGCA 
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GCCGAGGCATGAGTGGATGATGTCCCGTCGGCAGCCCGTGGTCCCTCTGGGG 
GGCAAGGTGCCTGTTCCCCATTGCTCCTCCACCAGAACACCTCCTCCAGATCT 
CCGCCAGGTGCGCCCCCTGAAGGAAAAGGGGATGGGGGCCTGGCTCGTGCCT 
GGTGGCAAGTCCTCTTGGGCCCCAGAAAGACCCCATAATGGCCAGGTAGAAG 

5 CTCCTACCTGCTGCTGCCTGAAGAGTCTGGGGCCCTGTCGGCTGGTTCCATCG 
TCTTCCCTGTTTTCAAGCTACAGGAGCGCAATCTTTCCAGAGCATTGAGTGGC 
TAGAGGAACGGGCTTACGCTAGGCAGGCGCTGGCATCTTAGACAATACCTGA 
CATCTTTGCGGGAGCTCTGGTGAGAGGCCCCACAAGCGTGTGACCCTGTGCG 
GACTCCACCGCGGCAGCCTGGGAGCCAAGCCGATGGCTGTGCTGGCCGAAGG 

10 GAACACCGCCATGGTGCTCTGCTCTCTGCAGCCCTGGGTCTGGTTGGAAGTTG 
CATCCCTGGTGAAGGGCGGCCGCGGCTrtCCAGCTCTCCCAGGGCAGTTCCCG 
TGTTTTCTCTCCCGCCTTCCCCATAGTCCAGGACAAAAGTCAGACCTTGGTGG 
GGGGTGCGCCTGCTGCAGGGGCCCCCAGGCTTCTCCATTCGGGTTTAGTGGGA 
TTAGAGTGTGCGGCCATGGGGACAGGTGTTTCCTTTATGTTCTGCCTGAGGGC 

15 TGGAAAGAGCCTCATCAGTGAGTGCCACGTTCGGTGGGGGTCTCTGGTGCAG 
GGGAGCCGGCGGGCCGGGTTGGGTTATGTCCTTGTTTTCACATCTGACCCTGG 
CCAGCTGGAGAGCCTCGCTTGAAGCGCAGGGCAGCAGCTGACACAACACAAC 
GGGACGGGCTGAGCAGGCCGGGCGCAGGCAGGCTTGGGGGAGCTGGCACTG 
GCACTGCACGTGGGGCTCCTAGGGGGCTGCTAGGGAAGTCATTGGGGGATAA 

20 TA 
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SEQIDNO:7 
Human ltipc5 (mtrl) cDNA sequence 
gi|67151 16|gb|AF177473.1|AF177473 Homo sapiens MTR1 (MTR1) mKNA, . 
complete cds, alternatively spliced 
GAGGCCACCATGCAGGATGTCCAAGGCCCCCGTCCCGGAAGCCCCGGGGATG 
CTGAAGACCGGCGGGAGCTGGGCTTGCACAGGGGCGAGGTCAACTTTGGAGG 
GTCTGGGAAGAAGCGAGGCAAGTTTGTACGGGTGCCGAGCGGAGTGGCCCCG 
TCTGTGCTCTTTGACCTGCTGCTTGCTGAGTGGCACCTGCCGGCCCCCAACCT 
GGTGGTGTCCCTGGTGGGTGAGGAGCAGCCTTTCGCCATGAAGTCCTGGCTGC 
GGGATGTGCTGCGCAAGGGGCTGGTGAAGGCGGCTCAGAGCACAGGAGCCT 
GGATCCTGACCAGTGCCCTCCGCGTGGGCCTGGCCAGGCATGTCGGGCAGGC 
CGTGCGCGACCACTCGCTGGCCAGCACGTCCACCAAGGTCCGTGTGGTTGCTG 
TCGGCATGGCCTCGCTGGGCCGCGTCCTGCACCGCCGCATTCTGGAGGAGGC 
CCAGGAGGATTTTCCTGTCCACTACCCTGAGGATGACGGCGGCAGCCAGGGC 
CCCCTCTGTTCACTGGACAGCAACCTCTCCCACTTCATCCTGGTGGAGCCAGG 
CCCCCCGGGGAAGGGCGATGGGCTGACGGAGCTGCGGCTGAGGCTGGAGAA 
GCACATCTCGGAGCAGAGGGCGGGCTACGGGGGCACTGGCAGCATCGAGATC. 
CCTGTCCTCTGCTTGCTGGTCAATGGTGATCCCAACACCTTGGAGAGGATCTC 
CAGGGCCGTGGAGCAGGCTGCCCCGTGGCTGATCCTGGTAGGCTCGGGGGGC 
ATCGCCGATGTGCTTGCTGCCCTAGTGAACCAGCCCCACCTCCTGGTGCCCAA 
GGTGGCCGAGAAGCAGTTTAAGGAGAAGrrCCCCAGCAAGCATTTCTCTTGG 
GAGGACATCGTGCGCTGGACCAAGCTGCTGCAGAACATCACCTCACACCAGC 
ACCTGCTCACCGTGTATGACTTCGAGCAGGAGGGCTCCGAGGAGCTGGACAC 
GGTCATCCTGAAGGCGCTGGTGAAAGCCTGCAAGAGCCACAGCCAGGAGCCT 
CAGGACTATCTGGATGAGCTCAAGCTGGCCGTGGCCTGGGACCGCGTGGACA 
TCGCCAAGAGTGAGATCTTCAATGGGGACGTGGAGTGGAAGTCCTGTGACCT 
GGAGGAGGTGATGGTGGACGCCCTGGTCAGCAACAAGCCCGAGTTTGTGCGC 
CTCTTTGTGGACAACGGCGCAGACGTGGCCGACTTCCTGACGTATGGGCGGCT 

gcaggagctctaccgctccgtgtcacgcaagagCctgctcttcgacctgctgc 

AGCGGAAGCAGGAGGAGGCCCGGCTGACGCTGGCCGGCCTGGGCACCCAGC 

aggcccgggagccacccgcggggccaccggccttctccctgcacgaggtctc 

CCGCGTACTCAAGGACTTCCTGCAGGACGCCTGCCGAGGCTTCTACCAGGAC 
GGCCGGCCAGGGGACCGCAGGAGGGCGGAGAAGGGCCCGGCCAAGCGGCCC 
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ACGGGCCAGAAGTGGCTGCTGGACCTGAACCAGAAGAGCGAGAACCCCTGG 
CGGGACCTGTTCCTGTGGGCCGTGCTGCAGAACCGCGACGAGATGGCCACCT 
ACTTCTGGGCCATGGGCCAGGAAGGTGTGGCAGCCGCACTGGCCGCCTGCAA 
AATCCTCAAAGAGATGTCGCACCTGGAGACGGAGGCCGAGGCGGCCCGAGCC 
5 ACGCGCGAGGCGAAATACGAGCGGCTGGCCCTTGACCTCTTCTCCGAGTGCT 
ACAGCAACAGTGAGGCCCGCGCCTTCGCCCTGCTGGTGCGCCGGAACCGCTG 
CTGGAGCAAGACCACCTGCCTGCACCTGGCCACCGAGGCTGACGCCAAGGCC 
TTCTTTGCCCACGACGGCGTTCAGGCCTTCCTGACCAGGATCTGGTGGGGGGA 
CATGGCCGCAGGCACGCCCATCCTGCGGCTGCTAGGAGCCTTCCTCTGCCCCG 
10 CCCTCGTCTATACCAACCTCATCACCTTCAGTGAGGAAGCTCCCCTGAGGACA 
GGCCTGGAGGACCTGCAGGACCTGGACAGCCTGGACACGGAGAAGAGCCCG 
CTGTATGGCCTGCAGAGCCGGGTGGAGGAGCTGGTGGAGGCGCCGAGGGCTC 
AGGGTGACCGAGGCCCACGTGCTGTCTTCCTGCTCACACGCTGGCGGAAATTC 
TGGGGCGCTCCCGTGACTGTGTTCCTGGGGAACGTGGTCATGTACTTCGCCTT 
15 CCTCTTCCTGTTCACCTACGTCCTGCTGGTGGACTTCAGGCCGCCCCCCCAGG 
GCCCCTCAGGGCCCGAGGTCACCCTCTACTTCTGGGTCTTTACGCTGGTGCTG 
GAGGAAATCCGGCAGGGCTTCTTCACAGACGAGGACACACACCTGGTGAAGA 
AGTTCACACTGTATGTGGGGGACAACTGGAACAAGTGTGACATGGTGGCCAT 
CTTCCTGTTCATCGTGGGTGTCACCTGCAGGATGCTGCCGTCGGCGTTTGAGG 
20 CTGGCCGCACGGTCCTCGCCATGGACTTCATGGTGTTCACGCTGCGGCTGATC 
CATATCTTTGCCATACACAAGCAGCTGGGCCCCAAGATCATCGTGGTAGAGC . 
GCATGATGAAGGACGTCTTCTTCTTCCTCTTCTTTCTGAGCGTGTGGCTCGTGG 
CCTACGGTGTCACCACCCAGGCGCTGCTGCACCCCCATGACGGCCGCCTGGA 
GTGGATCTTCCGCCGGGTGCTCTACCGGCCCTACCTGCAGATCTTCGGCCAGA 
25 TCCCACTGGACGAGATTGATGAAGCCCGTGTGAACTGCTCCACCCACCCACTG 
CTGCTGGAGGACTCACCATCCTGCCCCAGCCTCTATGCCAACTGGCTGGTCAT 
CCTCCTGCTGGTCACCTTCCTGTTGGTCACCAATGTGCTGCTCATGAACCTGCT 
CATCGCCATGTTCAGCTACACGTTCCAGGTGGTGCAGGGCAACGCAGACATG 
TTCTGGAAGTTCCAGCGCTACAACCTGATTGTGGAGTACCACGAGCGCCCCGC 
30 CCTGGCCCCGCCCTTCATCCTGCTCAGCCACCTGAGCCTGACGCTCCGCCGGG 
TCTTCAAGAAGGAGGCTGAGCACAAGCGGGAGCACCTGGAGAGAGACCTGC 
• CAGACCCCCTGGACCAGAAGGTCGTCACCTGGGAGACAGTCCAGAAGGAGA 
ACTTCCTGAGCAAGATGGAGAAGCGGAGGAGGGACAGCGAGGGGGAGGTGC 
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TGCGGAAAACCGCCCACAGAGTGGACTTCATTGCCAAGTAQCTCGGGGGGCT 
GAGAGAGCAAGAAAAGCGCATCAAGTGTCTGGAGTCACAGATCAACTACTGC 
TCGGTGCTCGTGTCCTCCGTGGCTGACGTGCTGGCCCAGGGTGGCGGCCGCCG 
GAGCTCTCAGCACTGTGGCGAGGGAAGCCAGCTGGTGGCTGCTGACCACAGA 
5 GGTGGTTTAGATGGCTGGGAACAACCCGGGGCTGGCCAGCCTCCCTCGGACA 
CATGAGCTGCTTGGCCTGCCACGTGTGGGGCCACCTCTCTTCAGTTGGCCACC 
CTGCACGTTGTGCACTGACCTTTGCCGACCTCCAGCGGAACCCCCCAGGGGGC 
ACCAGCCCCCCAGCAGACAATGGCCCTCCTGGTGCCTCACCACAGACCCTCA 
CCCAAAGGAACCGCTCCTTGTCCCTCCTGGCCTCCCCGGAGGCACAGCAGTGT 
10 CATGGGGCTGTCTCCCCTGACAGGCACAACTCCCCGGGCAGAAAACGTGCCC 
CACCGCATCCCTACCTGGAAACTGACCAGCCTGCACTGTGGAAAAGCTGGCC 
CTGTGGCGTGACGGGGGAGCACCCCCATCCAGACTGCGAAGCTGCTCTGGGT 
CTGCACCCACCCCTGCCCTGACTTGTGTTGCCTGACAAGAGACT 
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SEQ ID NO:8 
Predicted human ltrpc6 amino acid sequence 
translation from GI67 16116 
MQDVQGPRPGSPGDAEDRRELGLHRGE\WGGSGKKRGKFVRVPSGVAPSVLF 

5 DLLLAEWHLPAPNLWSLVGEEQPFAMKSWLRDVLRKGLVKAAQSTGAWILTS 
AIJIVGLARHVGQAVPJDHSLASTSTKVRWAVGMASLGRVLHRRILEEAQEDFP ' 
VHYPEDDGGSQGPLCSLDSNLSHFILVEPGPPGKGDGLTELRLRLEKfflSEQRAGY 
GGTGSIEIPVLCLLWGDPNTLERISRAVEQAAPWLILVGSGGIADVLAALVNQPH 
LLWKVAEKQFKEKFPSKHFSWEDIVRWTKLLQNrrSHQHLLTVYDFEQEGSEEL 

10 DTVILKALVKACKSHSQEPQDYLDELKLAVAWDRVDIAKSEIFNGDVEWKSCD^ 
EE VMVD AL V SNKPEF V11LF\TDNG AI) V ADFLTYGRLQEL YRS VS RKS LLFDLLQR 
KQEEARLTLAGLGTQQAREPPAGPPAFSLHEVSRVLKDFLQDACRGFYQDGRPG 
DRRRAEKGP AKJRPTGQKWLLDLNQKSENP WRDLFLWAVLQNRIIEMATYFW 
MGQEGVAAALAACKILKEMSHLETEAEAARATREAKYERLALDLFSECYSNSEA 

15 RAF ALL VRRNRC WSKTT CLHL ATE AD AKAFF AHD GVQ AFLTRIWWGDMAAGTP 
IOILLGAFLCPALVYTNLITFSEEAPLRTGLEDLQDLDSLDTEKSPLYGLQSRVEEL 

VEAPRAQGDRGPRAVFIXTRWlfcKFWGAPVTVFL^ 

FRPPPQGPSGPEVTLYFWWTLVLEEIRQGFFTDEDTFiLVKKFTLYVGDNWNKCD 
MV AIFLFWGVTCRN1LPS AFEAGRTVLAMDFMV^ 
20 MMKDWFFLFFLSVWLVAYGVTTQALLHPHDGRLEW^ 

DEroEARVNCSTHPLLLEDSPSCPSLYANWLVILLLVTFLLVTNVLLMNLLIAMFS 
YTFQ W Q GN ADMF WKJQR YNLrVE YFIERP AL APPFILLSFILS LTLRR VFKKE AEH 
KR£HLERJ}LPDPLDQKVVTWETVQKENFLSK^ 

AKYLGGLREQEKRIKCLESQINYCSVLVSSVADVLAQGGGPRSSQHCGEGSQLV 
25 AADHRGGLDGWEQPGAGQPPSDT 
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